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(54) Title: METHODS FOR DETECTION OF GENETIC DISORDERS 



O (57) Abstract: The invention provides a method useful for detection of genetic disorders. The method comprises determining the 
^ sequence of alleles of a locus of interest, and quantitating a ratio for the alleles at the locus of interest, wherein the ratio indicates the 
Q presence or absence of a chromosomal abnormality. The present invention also provides a non-invasive method for the detection of 
^ chromosomal abnormalities in a -f£tus. The invention is especially useful as a non-invasive method for deiemiimng the sequence of 



fetal DNA. The invention further provides methods of, isolation of free DNA from a sample. 
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METHODS FOR DETECTION OF GENETIC DISORDERS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims priority to U.S. Patent Application No. 10/093,618, filed March 11, 
2002; provisional U.S. Patent Application Nos. 60/360,232 and 60/378,354, filed March 1, 2002, 
and May 8, 2002, respectively; and to P.C.T. Application No. US03/06198, filed February 28, 
2003. The contents of these applications are hereby incorporated by reference in their entirety 
herein. 

BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 
The present invention is directed to a method for the detection of genetic disorders 
including chromosomal abnormalities and mutations. The present invention provides a rapid, 
non-invasive method for determining the sequence of DNA from a fetus. The method is especially 
useful for detection of chromosomal abnormalities in a fetus including translocations, transversions, 
monosomies, trisomies, and other aneuplodies, deletions, additions, amplifications, translocations 
and rearrangements. 

BACKGROUND ART 

Chromosomal abnormalities are responsible for a significant portion of genetic defects in 
liveborn humans. The nucleus of a human cell contains forty-six (46) chromosomes, which contain 
the genetic instructions, and determine the operations of the cell. Half of the forty-six 
chromosomes originate from each parent. Except for the sex chromosomes, which are quite 
different from each other in normal males, the chromosomes from the mother and the chromosomes 
from the father make a matched set. The pairs were combined when the egg was fertilized by the 
sperm. Occasionally, an error occurs in either the formation or combination of chromosomes, and 
the fertilized egg is formed with too many or too few chromosomes, or with chromosomes that are 
mixed in some way. Because each chromosome contains many genes, chromosomal abnormalities 
are likely to cause serious birth defects, affecting many body systems and often including 
developmental disability (e.g., mental retardation). 

Cells mistakenly can rejoin broken ends of chromosomes, both spontaneously and after 
exposure to chemical compounds, carcinogens, and irradiation. When rejoining occurs within a 
chromosome, a chromosome segment between the two breakpoints becomes inverted and is 
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classified as an inversion. With inversions, there is no loss of genetic material; however, inversions 
can cause disruption of a critical gene, or create a fusion gene that induces a disease related 
condition. 

In a reciprocal translocation, two non-homologous chromosomes break and exchange 
5 fragments. In this scenario, two abnormal chromosomes result: each consists of a part derived from 
the other chromosome and lacks a part of itself. If the translocation is of a balanced type, the 
individual will display no abnormal phenotypes. However, during germ-cell formation in the 
translocation-bearing individuals, the proper distribution of chromosomes in the egg or sperm 
occasionally fails, resulting in miscarriage, malformation, or mental retardation of the offspring. 

10 In a Robertsonian translocation, the centromeres of two acrocentric (a chromosome with a 

non-centrally located centromere) chromosomes fuse to generate one large metacentric 
chromosome. The karyotype of an individual with a centric fusion has one less than the normal 
diploid number of chromosomes. 

Errors that generate too many or too few chromosomes can also lead to disease phenotypes. 

15 For example, a missing copy of chromosome X (monosomy X) results in Turner's Syndrome, while 
an additional copy of chromosome 21 results in Down's Syndrome. Other diseases such as 
Edward's Syndrome, and Patau Syndrome are caused by an additional copy of chromosome 18, and 
chromosome 13, respectively. 

One of the most common chromosome abnormalities is known as Down syndrome. The 

20 estimated incidence of Down's syndrome is between 1 in 1,000 to 1 in 1,100 live births. Each year 
approximately 3,000 to 5,000 children are born in the U.S. with this chromosomal disorder. The 
vast majority of children with Down syndrome (approximately 95 percent) have an extra 
chromosome 21. Most often, the extra chromosome originates from the mother. However, in about 
3-4 percent of people with Down syndrome, a translocation between chromosome 21 and either 14 

25 or 22 is responsible for the genetic abnormality. Finally, another chromosome problem, called 
mosaicism, is noted in about 1 percent of individuals with Down's syndrome. In this case, some 
cells have 47 chromosomes and others have 46 chromosomes. Mosiacism is thought to be the 
result of an error in cell division soon after conception. 

Chromosomal abnormalities are congential, and therefore, prenatal diagnosis can be used to 

30 determine the health and condition of an unborn fetus. Without knowledge gained by prenatal 
diagnosis, there could be an untoward outcome for the fetus or the mother or both. Congenital 
anomalies account for 20 to 25% of perinatal deaths. Specifically, prenatal diagnosis is helpful for 
managing the remaining term of the pregnancy, planning for possible complications with the birth 
process, preparing for problems that can occur in the newborn infant, and finding conditions that 

35 may affect future pregnancies. 
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There are a variety of non-invasive and invasive techniques available for prenatal diagnosis 
including ultrasonography, amniocentesis, chorionic villus sampling (CVS), fetal blood cells in 
maternal blood, maternal serum alpha-fetoprotein, maternal serum beta-HCG, and maternal serum 
estriol. However, the techniques that are non-invasive are less specific, and the techniques with 
5 high specificity and high sensitivity are highly invasive. Furthermore, most techniques can be 
applied only during specific time periods during pregnancy for greatest utility. 

Ultrasonography 

This is a harmless, non-invasive procedure. High frequency sound waves are used to 
10 generate visible images from the pattern of the echoes made by different tissues and organs, 
including the fetus in the amniotic cavity. The developing embryo can be visualized at about 6 
weeks of gestation. The major internal organs and extremities can be assessed to determine if any 
are abnormal at about 16 to 20 weeks gestation. 

An ultrasound examination can be useful to determine the size and position of the fetus, the 
15 amount of amniotic fluid, and the appearance of fetal anatomy; however, there are limitations to this 
. procedure. Subtle abnormalities, such as Down syndrome, where the morphologic abnormalities 
are often not marked, but only subtle, may not be detected at all. 

Amniocentesis 

20 This is a highly invasive procedure in which a needle is passed through the mother's lower 

abdomen into the amniotic cavity inside the uterus. This procedure can be performed at about 14 
weeks gestation. For prenatal diagnosis, most amniocenteses are performed between 14 and 20 
weeks gestation. However, an ultrasound examination is performed, prior to amniocentesis, to 
determine gestational age, position of the fetus and placenta, and determine if enough amniotic fluid 

25 is present. Within the amniotic fluid are fetal cells (mostly derived from fetal skin) which can be 
grown in culture for chromosomal, biochemical, and molecular biologic analyses. 

Large chromosomal abnormalities, such as extra or missing chromosomes or chromosome 
fragments, can be detected by karyotyping, which involves the identification and analysis of all 46 
chromosomes from a cell and arranges them in their matched pairs, based on subtle differences in 

30 size and structure. In this systematic display, abnormalities in chromosome number and structure 
are apparent. This procedure typically takes 7-10 days for completion. 

While amniocentesis can be used to provide direct genetic information, risks are associated 
with the procedure including fetal loss and maternal Rh sensitization. The increased risk for fetal 
mortality following amniocentesis is about 0.5% above what would normally be expected. Rh 

3 5 negative mothers can be treated with RhoGam. 
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Chorionic Villus Sampling (CVS) 

In this procedure, a catheter is passed via the vagina through the cervix and into the uterus 
to the developing placenta with ultrasound guidance. The introduction of the catheter allows cells 
5 from the placental chorionic villi to be obtained and analyzed by a variety of techniques, including 
chromosome analysis to determine the karyotype of Hie fetus. The cells can also be cultured for 
biochemical or molecular biologic analysis. Topically, CVS is performed between 9.5 and 12.5 
weeks gestation. 

CVS has the disadvantage of being an invasive procedure, and it has a low but significant 
10 rate of morbidity for the fetus; this loss rate is about 0.5 to 1 % higher than for women undergoing 
amniocentesis. Rarely, CVS can be associated with limb defects in the fetus. Also, the possibility 
of maternal Rh sensitization is present. Furthermore, there is also the possibility that maternal 
blood cells in the developing placenta will be sampled instead of fetal cells and confound 
chromosome analysis. 

15 

Maternal Serum AIpha-Fetoprotein (MSAFP) 

The developing fetus has two major blood proteins-albumin and alpha-fetoprotein (AFP). 
The mother typically has only albumin in her blood, and thus, the MSAFP test can be utilized to 
determine the levels of AFP from the fetus. Ordinarily, only a small amount of AFP gains access to 

20 the amniotic fluid and crosses the placenta to mother's blood. However, if the fetus has a neural 
tube defect, then more AFP escapes into the amniotic fluid. Neural tube defects include 
anencephaly (failure of closure at the cranial end of the neural tube) and spina bifida (failure of 
closure at the caudal end of the neural tube). The incidence of such defects is about 1 to 2 births per 
1000 in the United States. Also, if there are defects in the fetal abdominal wall, the AFP from the 

25 fetus will end up in maternal blood in higher amounts. 

The amount of MSAFP increases with gestational age, and thus for the MSAFP test to 
provide accurate results, the gestational age must be known with certainty. Also, the race of the 
mother and presence of gestational diabetes can influence die level of MSAFP that is to be 
considered normal. The MSAFP is typically reported as multiples of the mean (MoM). The greater 

30 the MoM, the more likely a defect is present. The MSAFP test has the greatest sensitivity between 
16 and 18 weeks gestation, but can be used between 15 and 22 weeks gestation. The MSAFP tends 
to be lower when Down's Syndrome or other chromosomal abnormalities is present. 

While the MSAFP test is non-invasive, the MSAFP is not 100% specific. MSAFP can be 
elevated for a variety of reasons that are not related to fetal neural tube or abdominal wall defects. 
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The most common cause for an elevated MSAFP is a wrong estimation of the gestational age of the 
fetus. Therefore, results from an MSAFP test are never considered definitive and conclusive. 

Maternal Serum Beta-HCG 

5 Beginning at about a week following conception and implantation of the developing 

embryo into the uterus, the trophoblast will produce detectable beta-HCG (the beta subunit of 
human chorionic gonadotropin), which can be used to diagnose pregnancy. The beta-HCG also can 
be quantified in maternal serum, and this can be useful early in pregnancy when threatened abortion 
or ectopic pregnancy is suspected, because the amount of beta-HCG will be lower than normal. 
10 In the middle to late second trimester, the beta-HCG can be used in conjunction with the 

MSAFP to screen for chromosomal abnormalities, in particular for Down syndrome. An elevated 
beta-HCG coupled with a decreased MSAFP suggests Down syndrome. High levels of HCG 
suggest trophoblastic disease (molar pregnancy). The absence of a fetus on ultrasonography along 
with an elevated HCG suggests a hydatidiform mole. 

15 

Maternal Serum Estriol 

The amount of estriol in maternal serum is dependent upon a viable fetus, a properly 
functioning placenta, and maternal well-being. Dehydroepiandrosterone (DHEA) is made by the 
fetal adrenal glands, and is metabolized in the placenta to estriol. The estriol enters the maternal 
20 circulation and is excreted by the maternal kidney in urine or by the maternal liver in the bile. 
Normal levels of estriol, measured in the third trimester, will give an indication of general 
well-being of the fetus. If the estriol level drops, then the fetus is threatened and an immediate 
delivery may be necessary. Estriol tends to be lower when Down syndrome is present and when 
there is adrenal hypoplasia with anencephaly. 

25 

The Triple Screen Test 

The triple screen test comprises analysis of maternal serum alpha-feto-protein (MSAFP), 
human chorionic gonadotropin (hCG), and unconjugated estriol (uE3). The blood test is usually 
performed 16-18 weeks after the last menstrual period. While the triple screen test is non-invasive, 
30 abnormal test results are not indicative of a birth defect Rather, the test only indicates an increased 
risk and suggests that further testing is needed. For example, 100 out of 1,000 women will have an 
abnormal result from the triple screen test. However, only 2-3 of the 100 women will have a fetus 
with a birth defect. This high incidence of false positives causes tremendous stress and unnecessary 
anxiety to the expectant mother. 

35 
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Fetal Cells Isolated From Maternal Blood 

The presence of fetal nucleated cells in maternal blood makes it possible to use these cells 
for noninvasive prenatal diagnosis (Walknowska, et al., Lancet 1:1119-1 122, 1969; Lo et al., Lancet 
2:1363-65, 1989; Lo et al., Blood 88:4390-95, 1996). The fetal cells can be sorted and analyzed by 
a variety of techniques to look for particular DNA sequences (Bianchi et al., Am. J. Hum. Genet 
61:822-29, (1997); Bianchi et al, PNAS 93:705-08, (1996)). Fluorescence in-situ hybridization 
(FISH) is one technique that can be applied to identify particular chromosomes of the fetal cells 
recovered from maternal blood and diagnose aneuploid conditions such as trisomies and monosomy 
X. Also, it has been reported that the number of fetal cells in maternal blood increases in aneuploid 
pregnancies. 

The method of FISH uses DNA probes labeled with colored fluorescent tags that allow 
detection of specific chromosomes or genes under a microscope. Using FISH, subtle genetic 
abnormalities that cannot be detected by standard karyotyping are readily identifiable. This 
procedure typically takes 24-48 hours to complete. Additionally, using a panel of multi-colored 
DNA FISH probes, abnormal chromosome copy numbers can be seen. 

While improvements have been made for the isolation and enrichment of fetal cells, it is 
still difficult to get many fetal blood cells. There may not be enough to reliably determine 
anomalies of the fetal karyotype or assay for other abnormalities. Furthermore, most techniques are 
time consuming, require high-inputs of labor, and are difficult to implement for a high throughput 
fashion. 

Fetal DNA From Maternal Blood 

Fetal DNA has been detected and quantitated in maternal plasma and serum (Lo et al., 
Lancet 350:485-487 (1997); Lo et al., Am. J. hum. Genet. 62:768-775 (1998)). Multiple fetal cell 
types occur in the maternal circulation, including fetal granulocytes, lymphocytes, nucleated red 
blood cells, and trophoblast cells (Pertl, and Bianchi, Obstetrics and Gynecology 98: 483-490 
(2001)). Fetal DNA can be detected in the serum at the seventh week of gestation, and increases 
with the term of the pregnancy. The fetal DNA present in the maternal serufn and plasma is 
comparable to the concentration of DNA obtained from fetal cell isolation protocols. 

Circulating fetal DNA has been used to determine the sex of the fetus (Lo et al., Am. J. 
hum. Genet. 62:768-775 (1998)). Also, fetal rhesus D genotype has been detected using fetal DNA. 
However, the diagnostic and clinical applications of circulating fetal DNA is limited to genes that 
are present in the fetus but not in the mother (Pertl and Bianchi, Obstetrics and Gynecology 98: 
483-490 (2001)). Thus, a need still exists for a non-invasive method that can determine the 
sequence of fetal DNA and provide definitive diagnosis of chromosomal abnormalities in a fetus. 
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BRIEF SUMMARY OF THE INVENTION 
The invention is directed to a method for detection of genetic disorders including mutations 
and chromosomal abnormalities. In a preferred embodiment, the present invention is used to detect 
mutations, and chromosomal abnormalities including but not limited to translocation, transversion, 
monosomy, trisomy, and other aneuploidies, deletion, addition, amplification, -fragment, 
translocation, and rearrangement. Numerous abnormalities can be detected simultaneously. The 
present invention also provides a non-invasive method to determine the sequence of fetal DNA 
from a sample of a pregnant female. The present invention can be used to detect any alternation in 
gene sequence as compared to the wild type sequence including but not limited to point mutation, 
reading frame shift, transition, transversion, addition, insertion, deletion, addition-deletion, frame- 
shift, missense, reverse mutation, and microsatellite alteration. The present invention also provides 
a method for isolating free nucleic acid from a sample containing nucleic acid. 

In one embodiment, the present invention is directed to a method for detecting 
chromosomal abnormalities said method comprising: (a) determining the sequence of alleles of a 
locus of interest on template DNA, and (b) quantitating a ratio for the alleles at a heterozygous 
locus of interest that was identified from the locus of interest of (a), wherein said ratio indicates the 
presence or absence of a chromosomal abnormality. 

In another embodiment, the present invention provides a non-invasive method for 
determining the sequence of a locus of interest on fetal DNA, said method comprising: (a) obtaining 
a sample from a pregnant female; (b) adding a cell lysis inhibitor, a cell membrane stabilizer, or a 
cross-linker to the sample of (a); (c) obtaining template DNA from the sample of (b), wherein said 
template DNA comprises fetal DNA and maternal DNA; and (d) determining the sequence of a 
locus of interest on template DNA. 

In another embodiment, the template DNA is obtained from a sample including but not 
limited to a cell, tissue, blood, serum, plasma, saliva, urine, tears, vaginal secretion, sweat, 
umbilical cord blood, chorionic villi, amniotic fluid, embryonic tissue, embryo, a two-celled 
embryo, a four-celled embryo, an eight-celled embryo, a 16-celled embryo, a 32- celled embryo, a 
64-celled embryo, a 128-celled embryo, a 256-ceIled embryo, a 512-celled embryo, a 1024-celled 
embryo, lymph fluid, cerebrospinal fluid, mucosa secretion, peritoneal fluid, ascitic fluid, fecal 
matter, or body exudate. 

In one embodiment, the template DNA is obtained from a sample from a pregnant female. 
In a preferred embodiment, the template DNA is obtained from a pregnant human female. 

In another embodiment, the template DNA is obtained from an embryo. In a preferred 
embodiment, the template DNA is obtained from a single cell from an embryo. 
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In another embodiment, a cell lysis inhibitor is added to the sample including but not 
limited to formaldehyde, and derivatives of formaldehyde, formalin, glutaraldehyde, and derivatives 
of glutaraldehyde, crosslinkers, primary amine reactive crosslinkers, sulfhydryl reactive 
crosslinkers, sulfhydryl addition or disulfide reduction, carbohydrate reactive crosslinkers, carboxyl 
5 reactive crosslinkers, photoreactive crosslinkers, cleavable crosslinkers, AEDP, APG, BASED, 
BM(PEO) 3 , BM(PEO) 4 , BMB, BMDB, BMH, BMOE, BS3, BSOCOES, DFDNB, DMA, DMP, 
DMS, DPDPB, DSG, DSP, DSS, DST, DTBP, DTME, DTSSP, EGS, HBVS, sulfo-BSOCOES, 
Sulfo-DST, Sulfo-EGS or compounds listed in Table XXEtt. In a preferred embodiment, formalin 
is present in the sample at a percentage including but not limited to 0.0001-0.03%, 0.03-0.05%, 

10 0.05-0.08%, 0.08-0.1%, 0.1-0.3%, 0.3-0.5%, 0.5-0.7%, 0.7-0.9%, 0.9-1.2%, 1.2-1.5%, 1.5-2%, 2- 
3%, 3-5%, and greater than 5%. 

An agent that stabilizes cell membranes may be added to the maternal blood sample to 
reduce maternal cell lysis including but not limited to aldehydes, urea formaldehyde, phenol 
formaldehyde, DMAE (dimethylaminoethanol), cholesterol, cholesterol derivatives, high 

15 concentrations of magnesium, vitamin E, and vitamin E derivatives, calcium, calcium gluconate, 
taurine, niacin, hydroxylamine derivatives, bimoclomol, sucrose, astaxanthin, glucose, 
amitriptyline, isomer A hopane tetral phenylacetate, isomer B hopane tetral phenylacetate, 
citicoline, inositol, vitamin B, vitamin B complex, cholesterol hemisuccinate, sorbitol, calcium, 
coenzyme Q, ubiquinone, vitamin K, vitamin K complex, menaquinone, zonegran, zinc, ginkgo 

20 biloba extract, diphenylhydantoin, perftoran, polyvinylpyrrolidone, phosphatidylserine, tegretol, 

PABA, disodium cromglycate, nedocromil sodium, phenytoin, zinc citrate, mexitil, dilantin, sodium 
hyaluronate, or polaxamer 188. 

In another embodiment, an agent that prevents DNA destruction is added to the sample 
including but not limited to DNase inhibitors, zinc chloride, ethylenediaminetetraacetic acid, 

25 guanidine-HCI, guanidine isothiocyanate, N-lauroylsarcosine, and Na-dodecylsulphate. 

In a preferred embodiment, template DNA is obtained from the plasma of the blood from a 
pregnant female. In another embodiment, the template DNA is obtained from the serum of the 
blood from a pregnant female. 

In another embodiment, template DNA comprises fetal DNA and maternal DNA. 

30 In another embodiment, the locus of interest on the template DNA is selected from a 

maternal homozygous locus of interest. In another embodiment, the locus of interest on the 
template DNA is selected from a maternal heterozygous locus of interest. , 

In another embodiment, the locus of interest on the template DNA is selected from a 
paternal homozygous locus of interest. In another embodiment, the locus of interest on the template 

35 DNA is selected from a paternal heterozygous locus of interest. 
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In one embodiment, the sequence of alleles of multiple loci of interest on a single 
chromosome is determined. In a preferred embodiment, the sequence of alleles of multiple loci of 
interest on multiple chromosomes is determined. 

In another embodiment, determining the sequence of alleles of a locus of interest comprises 
a method including but not limited to allele specific PCR, gel electrophoresis, ELISA, mass 
spectrometry, hybridization, primer extension, fluorescence polarization, fluorescence detection, 
fluorescence resonance energy transfer (FRET), sequencing, DNA microarray, SNP-IT, GeneChips, 
HuSNP, BeadArray, TaqMan assay, Invader assay, MassExtend, MassCleave™ (hMC) method, 
southern blot, slot blot, dot blot, and MALDI-TOF mass spectrometry. 

In a preferred embodiment, determining the sequence of alleles of a locus of interest 
comprises (a) amplifying the locus of interest using a first and second primers, wherein the second 
primer contains a recognition site for a restriction enzyme that generates a 5' overhang containing 
the locus of interest; (b) digesting the amplified DNA with the restriction enzyme that recognizes 
the recognition site on the second primer; (c) incorporating a nucleotide into the digested DNA of 
(b) by using the 5' overhang containing the locus of interest as a template; and (d) determining the 
sequence of the locus of interest by determining the sequence of the DNA of (c). 

In one embodiment, the amplification can comprise polymerase chain reaction (PCR). In a 
further embodiment, the annealing temperature for cycle 1 of PCR can be about the melting 
temperature of the annealing length of the second primer. In another embodiment, the annealing 
temperature for cycle 2 of PCR can be about the melting temperature of the 3* region, which 
anneals to the template DNA, of the first primer. In another embodiment, the annealing 
temperature for the remaining cycles can be about the melting temperature of the entire sequence of 
the second primer. 

In another embodiment, the recognition site on the second primer is for a restriction enzyme 
that cuts at a distance from its binding site and generates a 5* overhang, which contains the locus of 
interest. In a preferred embodiment, the recognition site on the second primer is for a Type IIS 
restriction enzyme. The Type IIS restriction enzyme includes but is not limited to Alw I, Alw26 1, 
Bbs I, Bbv I, BceA I, Bmr I, Bsa I, Bst71 1, BsmA I, BsmB I, BsmF I, BspM I, Ear I, Fau I, Fok I, 
Hga I, Pie I, Sap I, SSfaN I, and Sthi32 1, and more preferably BceA I and BsmF L 

In one embodiment, the 3 ' end of the second primer is adjacent to the locus of interest. 

In another embodiment, the annealing length of the second primer is selected from the 
group consisting of 35-30, 30-25, 25-20, 20-15, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, and less than 
4 bases. 

In another embodiment, amplifying the loci of interest comprises using first and second 
primers that contain a portion of a restriction enzyme recognition site, wherein said recognition site 
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contains at least one variable nucleotide, and after amplification the full restriction enzyme 
recognition site is generated, and the 3* region of said primers can contain mismatches with the 
template DNA, and digestion with said restriction enzyme generates a 5' overhang containing the 
locus of interest 

5 In a preferred embodiment, the recognition site for restriction enzymes including but not 

limited to BsaJ I (5' c'CNNGG 3'), BssK I (5' A CCNGG 3'), Dde I (5'C l TNAG 3'), EcoN I 
(5'CCTNN i NNNAGG 3'), Fnu4H I (5'GC*NGC 3'), Hinf I (5'G A ANTC 3'), PAF 1(5' 
GACN ; NNGTC 3'), Sau96 1 (5' g'GNCC 3 s ), ScrF I (5' CC j NGG 3'), Tthl 1 1 1 (5' 
GACN l NNGTC 3'), and more preferably Fnu4H I and EcoN I, is generated Rafter amplification. 

10 In another embodiment, the 5' region of the first and/or second primer contains a 

recognition site for a restriction enzyme. In a prefeired embodiment, the restriction enzyme 
recognition site is different from the restriction enzyme recognition site that generates a 5' overhang 
containing the locus of interest. 

In a further embodiment, the method of the invention further comprises digesting the DNA 

15 with a restriction enzyme that recognizes the recognition site at the 5' region of the first and/or 
second primer. 

The first and/or second primer can contain a tag at the 5' terminus. Preferably, the first 
primer contains a tag at the 5' terminus. The tag can be used to separate the amplified DNA from 
the template DNA. The tag can be used to separate the amplified DNA containing the labeled 

20 nucleotide from the amplified DNA that does not contain the labeled nucleotide. The tag can be 
any chemical moiety including but not limited to radioisotope, fluorescent reporter molecule, 
chemiluminescent reporter molecule, antibody, antibody fragment, hapten, biotin, derivative of 
biotin, photobiotin, iniinobiotin, digoxigenin, avidin, enzyme, acridinium, sugar, enzyme, 
apoenzyme, homopolymeric oligonucleotide, hormone, ferromagnetic moiety, paramagnetic 

25 moiety, diamagnetic moiety, phosphorescent moiety, luminescent moiety, electrochemiluminescent 
moiety, chromatic moiety, moiety having a detectable electron spin resonance, electrical 
capacitance, dielectric constant or electrical conductivity, or combinations thereof. Preferably, the 
tag is biotin. The biotin tag is used to separate amplified DNA from the template DNA using a 
streptavidin matrix. The streptavidin matrix is coated on wells of a microtiter plate. 

30 The incorporation of a nucleotide in the method of the invention is by a DNA polymerase 

including but not limited to E. coli DNA polymerase, Klenow fragment of E. coli DNA polymerase 
I, T7 DNA polymerase, T4 DNA polymerase, T5 DNA polymerase, Klenow class polymerases, 
Taq polymerase, Pfii DNA polymerase, Vent polymerase, bacteriophage 29, REDTaq™ Genomic 
DNA polymerase, or sequenase. The incorporation of a nucleotide can further comprise using a 

35 mixture of labeled and unlabeled nucleotides. One nucleotide, two nucleotides, three nucleotides, 
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four nucleotides, five nucleotides, or more than five nucleotides can be incorporated. A 
combination of labeled and unlabeled nucleotides can be incorporated. The labeled nucleotide is 
selected from the group consisting of a dideoxynucleotide triphosphate (also referred to as 
"dideoxy") and deoxynucleotide triphosphate (also referred to as "deoxy"). The unlabeled 

5 nucleotide is selected from the group consisting of a dideoxynucleotide triphosphate and 

deoxynucleotide triphosphate. The labeled nucleotide is labeled with a molecule including but not 
limited to radioactive molecule, fluorescent molecule, antibody, antibody fragment, hapten, 
carbohydrate, biotin, and derivative of biotin, phosphorescent moiety, luminescent moiety, 
electrochemiluminescent moiety, chromatic moiety, and moiety, having a detectable electron spin 

10 resonance, electrical capacitance, dielectric constant or electrical conductivity. Preferably, the 
labeled nucleotide is labeled with a fluorescent molecule. The incorporation of a fluorescent 
labeled nucleotide further comprises using a mixture of fluorescent and unlabeled nucleotides. 

In one embodiment, the determination of the sequence of the locus of interest comprises 
detecting the incorporated nucleotide. In one embodiment, the detection is by a method selected 

1 5 from the group consisting of gel electrophoresis, capillary electrophoresis, microchannel 
electrophoresis, polyacrylamide gel electrophoresis, fluorescence detection, fluorescence 
polarization, DNA sequencing, Sanger dideoxy sequencing, ELISA, mass spectrometry, time of 
flight mass spectrometry, quadrupole mass spectrometry, magnetic sector mass spectrometry, 
electric sector mass spectrometry, fluorometry, infrared spectrometry, ultraviolet spectrometry, 

20 palentiostatic amperometry, DNA hybridization, DNA microarray, GeneChip arrays, HuSNP 
arrays, BeadArrays, MassExtend, SNP-IT, TaqMan assay, Invader assay, MassCleave, southern 
blot, slot blot, and dot blot. 

In one embodiment, the sequence of alleles of one to tens to hundreds to thousands of loci 
of interest on a single chromosome on template DNA is determined. In a preferred embodiment, 

25 the sequence of alleles of one to tens to hundreds to thousands of loci of interest on multiple 
chromosomes is determined. 

In a preferred embodiment, the locus of interest is suspected of containing a single 
nucleotide polymorphism or mutation. The method can be used for determining sequences of 
multiple loci of interest concurrently. The template DNA can comprise multiple loci from a single 

30 chromosome. The template DNA can comprise multiple loci from different chromosomes. The 

loci of interest on template DNA can be amplified in one reaction. Alternatively, each of the loci of 
interest on template DNA can be amplified in a separate reaction. The amplified DNA can be 
pooled together prior to digestion of the amplified DNA. Each of the labeled DNA containing a 
locus of interest can be separated prior to determining the sequence of the locus of interest. In one 
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embodiment, at least one of the loci of interest is suspected of containing a single nucleotide 
polymorphism or a mutation. 

In another embodiment, the ratio of alleles at a heterozygous locus of interest on a 
chromosome is compared to the ratio of alleles at a heterozygous locus of interest on a different 
5 chromosome. There is no limitation as to the chromosomes that can be compared. The ratio for the 
alleles at a heterozygous locus of interest on any chromosome can be compared to the ratio for the 
alleles at a heterozygous locus of interest on any other chromosome. In a preferred embodiment, 
the ratio of alleles at multiple heterozygous loci of interest on a chromosome are summed and 
compared to the ratio of alleles at multiple heterozygous loci of interest on a different chromosome. 

10 In another embodiment, the ratio of alleles at a heterozygous locus of interest on a 

chromosome is compared to the ratio of alleles at a heterozygous locus of interest on two, three, 
four or more than four chromosomes. In another embodiment, the ratio of alleles at multiple loci of 
interest on a chromosome is compared to the ratio of alleles at multiple loci of interest on two, 
three, four, or more than four chromosomes. 

15 In another embodiment, the ratio of the alleles at a locus of interest on a chromosome is 

compared to the ratio of the alleles at a locus of interest on a different chromosome, wherein a 
difference in the ratios indicates the presence or absence of a chromosomal abnormality. In another 
embodiment, the ratio of the alleles at multiple loci of interest on a chromosome is compared to the 
ratio of the alleles at multiple loci of interest on a different chromosome, wherein a difference in the 

20 ratios indicates the presence or absence of a chromosomal abnormality. 

In another embodiment, the sequence of one to tens to hundreds to thousands of loci of 

interest on the template DNA obtained from a sample of a pregnant female is determined. In one 

embodiment, the loci of interest are on one chromosome. In another embodiment, the loci of 

interest are on multiple chromosomes. 

25 In another embodiment, the present invention provides a method for isolating nucleic acid 

♦ 

said method comprising (a) obtaining a sample containing nucleic acid; (b) adding a cell lysis 
inhibitor, cell membrane stabilizer, or cross-linker to the sample of (a); and (c) isolating nucleic 
acid. In a preferred embodiment, the method is used for isolating free nucleic acid. In a most 
preferred embodiment, the method is used for isolating free fetal nucleic acid. 
30 In another embodiment, the present invention provides a method for detecting nucleic acid, 

wherein said method comprises: (a) isolating nucleic acid from a nucleic acid containg sample, 
wherein a cell lysis inhibitor, cell membrane stabilizer, or cross-linker was added to the sample; and 
(b) detecting the presence or absence of the nucleic acid. In a preferred embodiment, the nucleic 
acid isolated is free nucleic acid. 
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In another embodiment, the present invention provides a method for detecting nucleic acid 
containing at least one mutation, wherein said method comprises: (a) isolating nucleic acid from a 
nucleic acid containg sample, wherein a cell lysis inhibitor, cell membrane stabilizer, or cross-linker 
was added to the sample; and (b) detecting the presence or absence of nucleic acid containing the 
5 mutation. In a preferred embodiment, the nucleic acid isolated is free nucleic acid. 

In another embodiment, the present invention provides a method fondetecting nucleic acid 
containing at least one mutation, wherein said method comprises: (a) isolating nucleic acid from a 
blood sample, wherein a cell lysis inhibitor, cell membrane stabilizer, or cross-linker was added to 
the sample; and (b) detecting the presence or absence of a nucleic acid molecule containing the 
10 mutation. In a preferred embodiment, the nucleic acid isolated is free nucleic acid. 

In another embodiment, the present invention provides a method for detecting a nucleic 
acid molecule containing at least one mutation, wherein said method comprises: (a) isolating 
nucleic acid from a plasma sample, wherein a cell lysis inhibitor, cell membrane stabilizer, or cross- 
linker was added to a blood sample prior to isolation of the plasma; and (b) detecting the presence 
1 5 or absence of a nucleic acid molecule containing the mutation. In a preferred embodiment, the 
nucleic acid isolated is free nucleic acid. 

In another embodiment, the present invention provides a method for detecting a nucleic 
acid, wherein said method comprises: (a) isolating nucleic acid from a nucleic acid containing 
sample, wherein an agent that impedes cell lysis was added to the sample; and (b) detecting the 
20 presence or absence of the nucleic acid. 

In another embodiment, the present invention provides a method for isolating a nucleic 
acid, wherein said method comprises: (a) isolating nucleic acid from a nucleic acid containing 
sample, wherein an agent that impedes cell lysis was added to the sample. 

In another embodiment, the present invention provides a method for isolating nucleic acid 
25 said method comprising (a) obtaining a sample containing nucleic acid; (b) adding a cell lysis 
inhibitor, cell membrane stabilizer, or cross-linker to the sample of (a); and (c) isolating nucleic 
acid. In a preferred embodiment, the method is used for isolating free nucleic acid. In a most 
preferred embodiment, the method is used for isolating free fetal nucleic acid. 

In another embodiment, the presnt invention provides a method for isolating free fetal 
30 nucleic acid said method comprising (a) obtaining a sample containing nucleic acid; (b) adding a 
cell lysis inhibitor, cell membrane stabilizer, or cross-linker to the sample of (a); (c) isolating the 
plasma from the blood sample, wherein the plasma is isolated by centrifuging the blood sample; and 
(d) removing the supernatant, which contains the plasma, using procedures to minimize disruption 
of the "buflfr-coat." 
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In another embodiment, the sample containing the nucleic acid is obtained from any nucleic 
acid containg source including but not limited to a cell, tissue, blood, serum, plasma, saliva, urine, 
tears, breast fluid, breast milk, vaginal secretion, sweat, umbilical cord blood, chorionic villi, 
amniotic fluid, embryonic tissue, embryo, a two-celled embryo, a four-celled embryo, an 
5 eight-celled embryo, a 1 6-celIed embryo, a 32- celled embryo, a 64-celled embryo, a 128-celled 
embryo, a 256-celled embryo, a 512-celled embryo, a 1024-celled embryo, lymph fluid, 
cerebrospinal fluid, mucosa secretion, peritoneal fluid, ascitic fluid, fecal matter, or body exudate. 

In one embodiment, the sample containing the nucleic acid is obtained from a pregnant 
female. In a preferred embodiment, the sample is obtained from a pregnant human female. In a 
10 preferred embodiment, the sample is blood obtained from a pregnant female. 

f ! 

In another embodiment, a cell lysis inhibitor, a cell membrane stabilizer or a cross-linker is 
added to the sample including but not limited to formaldehyde, and derivatives of formaldehyde, 
formalin, glutaraldehyde, and derivatives of glutaraldehyde, crosslinkers, primary amine reactive 
crosslinkers, sulfhydryl reactive crosslinkers, sulfhydryl addition or disulfide reduction, 
15 carbohydrate reactive crosslinkers, carboxyl reactive crosslinkers, photoreactive crosslinkers, 
cleavable crosslinkers, AEDP, APG, BASED, BM(PEO)3, BM(PEO) 4j BMB, BMDB, BMH, 
BMOE, BS3, BSOCOES, DFDNB, DMA, DMP, DMS, DPDPB, DSG, DSP, DSS, DST, DTBP, 
DTME, DTSSP, EGS, HBVS, sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS or compounds listed in 
Table XXm. 

20 An agent that stabilizes cell membranes may be added to the sample including but not 

limited to aldehydes, urea formaldehyde, phenol formaldehyde, DMAE (dimethylaminoethanol), 
cholesterol, cholesterol derivatives, high concentrations of magnesium, vitamin E, and vitamin E 
derivatives, calcium, calcium gluconate, taurine, niacin, hydroxylamine derivatives, bimoclomol, 
sucrose, astaxanthin, glucose, amitriptyline, isomer A hopane tetral phenylacetate, isomer B hopane 

25 tetral phenylacetate, citicoline, inositol, vitamin B, vitamin B complex, cholesterol hemisuccinate, 
sorbitol, calcium, coenzyme Q, ubiquinone, vitamin K, vitamin K complex, menaquinone, 
• zonegran, zinc, ginkgo biloba extract, diphenylhydantoin, perftoran, polyvinylpyrrolidone, 
phosphatidylserine, tegretol, PABA, disodium cromglycate, nedocromil sodium, phenytoin, zinc 
citrate, mexitil, dilantin, sodium hyaluronate, or polaxamer 188. 

30 In another embodiment, an agent that prevents DNA destruction is added to the sample 

including but not limited to DNase inhibitors, zinc chloride, ethylenediaminetetraacetic acid, 
guanidine-HCI, guanidine isothiocyanate, N-lauroylsarcosine, and Na-dodecylsulphate. 

In a preferred embodiment, the nucleic acid is isolated from plasma obtained from blood of 
a pregnant female. In another embodiment, the nucleic acid is isolated from plasma, which is 

35 generated using procedures designed to minimize the amount of maternal cell lysis. In another 
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embodiment, the nucleic acid is isolated from plasma by centrifuging blood from a pregnant female, 
and allowing the centrifuge to stop without applying a brake. 

In a preferred embodiment, free nucleic acid is isolated from plasma obtained from blood 
from a pregnant female. In another embodiment, the free nucleic acid is isolated from plasma, 
which is generated by centrifiiging blood obtained from a pregnant female, and allowing the 
centrifuge to stop without applying a brake (the centrifuge comes to a stop by natural deceleration). 
In another embodiment, the blood from a pregnant female is centrifiiged at a speed including but 
not limited to 0-50 rpm, 50-100 rpm, 100-200 rpm, 200-300 rpm, 300-400 rpm, 400-500 rpm, 500- 
600 rpm, 600-700 rpm, 700-800 rpm, 800-900 rpm, 900-1000 rpm, 1000-2000 rpm, 2000-3000 
rpm, 3000-4000 rpm, 4000-5000 rpm, 5000-6000 ipm, 6000-7000 rpm, 7000-8000 rpm, and greater 
than 8000 rpm. In a preferred embodiment, the blood from the pregnant female is centrifuged at a 
speed less than 4000 rpm. Li another embodiment, the acceleration power of the centrifuge is not 
used. 



BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1A. A Schematic diagram depicting a double stranded DNA molecule. A pair of 
primers, depicted as bent arrows, flank the locus of interest, depicted as a triangle symbol at base 
N14. The locus of interest can be a single nucleotide polymorphism, point mutation, insertion, 
deletion, translocation, etc. Each primer contains a restriction en2yme recognition site about 10 by 
from the 5' terminus depicted as region "a" in the first primer and as region "d" in the second 
primer. Restriction recognition site "a" can be for any type of restriction enzyme but recognition 
site "d" is for a restriction enzyme, which cuts "n" nucleotides away from its recognition site and 
leaves a 5' overhang and a recessed 3' end. Examples of such enzymes include but are not limited 
to BceAI and BsmF I. The 5' overhang serves as a template for incorporation of a nucleotide into 
the 3 9 recessed end. 

The first primer is shown modified with biotin at the 5' end to aid in purification. The 
sequence of the 3* end of the primers is such that the primers anneal at a desired distance upstream 
and downstream of the locus of interest. The second primer anneals close to the locus of interest; 
the annealing site, which is depicted as region "c," is designed such that the 3' end of the second 
primer anneals one base away from the locus of interest. The second primer can anneal any 
distance from the locus of interest provided that digestion with the restriction enzyme, which 
recognizes the region "d" on this primer, generates a 5' overhang that contains the locus of interest 
The first primer annealing site, which is depicted as region "b," is about 20 bases. 
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FIG. IB. A schematic diagram depicting the annealing and extension steps of the first 
cycle of amplification by PCR. The first cycle of amplification is performed at about the melting 
temperature of the 3 J region, which anneals to the template DNA, of the second primer, depicted as 
region "c," and is 13 base pairs in this example. At this temperature, both the first and second 
5 primers anneal to their respective complementary strands and begin extension, depicted by dotted 
lines. In this first cycle, the second primer extends and copies the region b where the first primer 
can anneal in the next cycle. 

FIG. 1C. A schematic diagram depicting the annealing and extension steps following 
10 denaturation in the second cycle of amplification of PCR. The second cycle of amplification is 
performed at a higher annealing temperature (TM2), which is about the melting temperature of the 
20 by of the 3* region of the first primer that anneals to the template DNA, depicted as region "b." 
Therefore at TM2, the first primer, which contains region b' which is complementary to region b, 
can bind to the DNA that was copied in the first cycle of the reaction. However, at TM2 the second 
1 5 primer cannot anneal to the original template DNA or to DNA that was copifed in the first cycle of 
die reaction because the annealing temperature is too high. The second primer can anneal to 13 
bases in the original template DNA but TM2 is calculated at about the melting temperature of 20 
bases. 

20 ' FIG. ID. A schematic diagram depicting the annealing and extension reactions after 

denaturation during the third cycle of amplification. In this cycle, the annealing temperature, TM3, 
is about the melting temperature of the entire second primer, including regions "c" and "d. M The 
length of regions V + "d" is about 27-33 by long, and thus TM3 is significantly higher than TM1 
and TM2. At this higher TM the second primer, which contain regions c' and d', anneals to the 

25 copied DNA generated in cycle 2. 

FIG. IE. A schematic diagram depicting the annealing and extension reactions for the 
remaining cycles of amplification. The annealing temperature for the remaining cycles is TM3, 
which is about the melting temperature of the entire second primer. At TM3, the second primer 
30 binds to templates that contain regions c* and d' and the first primer binds to templates that contain 
regions a* and b. By raising the annealing temperature successively in each cycle for the first three 
cycles, from TM1, TM2, and TM3, nonspecific amplification is significantly reduced. 

FIG. IF. A schematic diagram depicting the amplified locus of interest bound to a solid 

35 matrix. 

j 

16 



WO 2004/078994 PCT/US2004/006337 



FIG. 1G. A schematic diagram depicting the bound, amplified DNA after digestion with 
restriction enzyme "d." The "downstream" end is released into the supernatant, and can be 
removed by washing with any suitable buffer. The upstream end containing the locus of interest 
5 remains bound to the solid matrix. 

FIG. 1H. A schematic diagram depicting the bound amplified DNA, after "filling in" with 
a labeled ddNTP. A DNA polymerase is used to "fill in" the base (N* 14) that is complementary to 
the locus of interest (N14). In this example, only ddNTPs are present in this reaction, such that 
1 0 only the locus of interest or SNP of interest is filled in. 

FIG. II. A schematic diagram depicting the labeled, bound DNA after digestion with 
restriction enzyme "a." The labeled DNA is released into the supernatant, which can be collected to 
identify the base that was incorporated. 

15 

FIG. 2. A schematic diagram depicting double stranded DNA templates with n number of 
loci of interest and n number of primer pairs, Xi, yi to Xn, y m specifically annealed such that a primer 
flanks each locus of interest. The first primers are biotinylated at the 5' end, depicted by », and 
contain a restriction enzyme recognition site, "a", which can be any type of restriction enzyme. The 

20 second primers contain a restriction enzyme recognition site, "d," where "d" is a recognition site for 
a restriction enzyme that cuts "n" nucleotides away from its recognition site, and generates a 5' 
overhang containing the locus of interest and a recessed 3' end. The second primers anneal 
adjacent to the respective loci of interest. The exact position of the restriction enzyme site "d" in 
the second primers is designed such that digesting the PCR product of each locus of interest with 

25 restriction enzyme "d" generates a 5' overhang containing the locus of interest and a 3' recessed 
end. The annealing sites of the first primers are about 20 bases long and are selected such that each 
successive first primer is further away from its respective second primer. For example, if at locus 1 
the Y ends of the first and second primers are Z base pairs apart, then at locus 2, the 3 * ends of the 
first and second primers are Z + K base pairs apart, where K = 1, 2, 3 or more than three bases. 

30 Primers for locus N are Zn-i + K base pairs apart. The purpose of making each successive first 
primer further apart from their respective second primers is such that the "filled in" restriction 
fragments (generated after amplification, purification, digestion and labeling as described in FIGS. 
IB- II) differ in size and can be resolved, for example by electrophoresis, to allow detection of each 
individual locus of interest. 

35 

I 
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FIG. 3. PCR amplification of SNPs using multiple annealing temperatures. A sample 
containing genomic DNA templates from thirty-six human volunteers was analyzed for the 
following four SNPs: SNP HC21S00340 (lane 1), identification number as assigned in the Human 
Chromosome 21 cSNP Database, located on chromosome 21; SNP TSC 00955 12 (lane 2), located 
5 on chromosome 1, SNP TSC 0214366 (lane 3), located on chromosome 1; and SNP TSC 00873 15 
(lane 4), located on chromosome 1. Each SNP was amplified by PCR using three different 
annealing temperature protocols, herein referred to as the low stringency annealing temperature; 
medium stringency annealing temperature; and high stringency annealing temperature. Regardless 
of the annealing temperature protocol, each SNP was amplified for 40 cycles of PCR. The 
10 denaturation step for each PCR reaction was performed for 30 seconds at 95°C. 

FIG. 3 A. Photograph of a gel demonstrating PCR amplification of the 4 different SNPs 
using the low stringency annealing temperature protocol. 

15 FIG. 3B. Photograph of a gel demonstrating PCR amplification of the 4 different SNPs 

using medium stringency annealing temperature protocol. 

FIG. 3C. Photograph of a gel demonstrating PCR amplification of the 4 different SNPs 
using the high stringency annealing temperature protocol. 

20 

FIG. 4A. A depiction of the DNA sequence of SNP HC21 S00027, as assigned by the 
Human Chromosome 21 cSNP database, located on chromosome 21. A first primer and a second 
primer are indicated above and below, respectively, the sequence of HC21S00027. The first primer 
is biotinylated and contains the restriction enzyme recognition site for EcoRI. The second primer 
25 contains the restriction enzyme recognition site for BsmF I and contains 13 bases that anneal to the 
DNA sequence. The SNP is indicated by R (A/G) and r (T/C) (complementary to R). 

FIG. 4B. A depiction of the DNA sequence of SNP HC21 S00027, as assigned by the 
Human Chromosome 21 cSNP database, located on chromosome 21. A first primer and a second 
30 primer are indicated above and below, respectively, the sequence of HC21S00027. The first primer 
is biotinylated and contains the restriction enzyme recognition site for EcoRI. The second primer 
contains the restriction enzyme recognition site for BceA I and has 13 bases that anneal to the DNA 
sequence. The SNP is indicated by R (A/G) and r (T/C) (complementary to R). 

FIG. 4C. A depiction of the DNA sequence of SNP TSC0095512 from chromosome 1. 
35 The first primer and the second primer are indicated above and below, respectively, the sequence of 
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TSC0095512. The first primer is biotinylated and contains the restriction enzyme recognition site 
for EcoRI. The second primer contains the restriction enzyme recognition site for BsmF I and has 
13 bases that anneal to the DNA sequence. The SNP is indicated by S (G/C) and s (C/G) 
(complementary to S). 

5 

FIG. 4D. A depiction of the DNA sequence of SNP TSC00955 1 2 from chromosome 1 . 
The first primer and the second primer are indicated above and below, respectively, the sequence of 
TSC0095512. The first primer is biotinylated and contains the restriction enzyme recognition site 
for EcoRI. The second primer contains the restriction enzyme recognition site for BceA I and has 
10 13 bases that anneal to the DNA sequence. The SNP is indicated by S (G/C) and s (C/G) 
(complementary to S). 

FIGS. 5A-5D. A schematic diagram depicting the nucleotide sequences of SNP 
HC21S00027 (FIGS. 5Aand5B)andSNPTSC0095512(FlGS. 5C and 5D) after amplification 
15 with the primers described in FIGS. 4A-4D. Restriction sites in the primer sequence are indicated 
in bold. 

FIGS. 6A-6D. A schematic diagram depicting the nucleotide sequences of each amplified 
SNP after digestion with the appropriate Type IIS restriction enryme. FIGS. 6A and 6B depict 
20 fragments of SNP HC21 S00027 digested with the Type IIS restriction enzymes BsmF I and BceA I, 
respectively. FIGS. 6C and 6D depict fragments of SNP TSC00955 12 digested with the Type US 
restriction enzymes BsmF I and BceA I, respectively. 

FIGS. 7A-7D. A schematic diagram depicting the incorporation of a fluorescently labeled 
25 nucleotide using the 5 * overhang of the digested SNP site as a template to "fill in" the 3 * recessed 
end. FIGS. 7A and 7B depict the digested SNP HC21S00027 locus with an incorporated labeled 
ddNTP (*R' dd = fluorescent dideoxy nucleotide). FIGS. 7C and 7D depict the digested SNP 
TSC0095512 locus with an incorporated labeled ddNTP (*S" dd = fluorescent dideoxy nucleotide). 
The use of ddNTPs ensures that the 3' recessed end is extended by one nucleotide, which is 
30 complementary to the nucleotide of interest or SNP site present in the 5' overhang. 

FIG. 7E. A schematic diag-am depicting the incorporation of dNTPs and a ddNTP into the 
5' overhang containing the SNP site. SNP HC21S00007 was digested with BsmF I, which 
generates a four base 5* overhang. The use of a mixture of dNTPs and ddNTPs allows the 3' 
35 recessed end to be extended one nucleotide (a ddNTP is incorporated first); two nucleotides (a 
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dNTP is incorporated followed by a ddNTP); three nucleotides (two dNTPs are incorporated, 
followed by a ddNTP); or four nucleotides (three dNTPs are incorporated, followed by a ddNTP). 
All four products can be separated by size, and the incorporated nucleotide detected (*R" dd = 
fluorescent dideoxy nucleotide). Detection of the first nucleotide, which corresponds to the SNP or 
5 locus site, and the next three nucleotides provides an additional level of quality assurance. The 
SNP is indicated by R (A/G) and r (T/C) (complementary to R). 

FIGS. 8A-8D. Release of the "filled in" SNP from the solid support matrix, i.e. 
streptavidin coated well. SNP HC21S00027 is shown in FIGS. 8A and 8B', while SNP 
10 TSC00955 12 is shown in FIGS. 8C and 8D. The 'Tilled in" SNP is free in solution, and can be 
detected. 

FIG. 9A. Sequence analysis of SNP HC21S00027 digested with BceAI. Four "fill in" 
reactions are shown; each reaction contained one fluorescently labeled nucleotide, ddGTP, ddATP, 
1 5 ddTTP, or ddCTP, and unlabeled ddNTPs. The 5 9 overhang generated by digestion with BceA I 
and the expected nucleotides at this SNP site are indicated. 

FIG. 9B. Sequence analysis of SNP TSC00955 12. SNPTSC0095512 was amplified with 
a second primer that contained the recognition site for BceA I, and in a separate reaction, with a 
20 second primer that contained the recognition site for BsmF I. Four fill in reactions are shown for 
each PGR product; each reaction contained one fluorescently labeled nucleotide, ddGTP, ddATP, ' 
ddTTP, or ddCTP, and unlabeled ddNTPs. The 5' overhang generated by digestion with BceA I 
and with BsmF I and the expected nucleotides are indicated. 

25 FIG. 9C. Sequence analysis of SNP TSC0264580 after amplification with a second primer 

that contained the recognition site for BsmF I. Four fill in reactions are shown; each reaction 
contained one fluorescently labeled nucleotide, which was ddGTP, ddATP, ddTTP, or ddCTP and 
unlabeled ddNTPs. Two different 5' overhangs are depicted: one represents the DNA molecules 
that were cut 1 1 nucleotides away on the sense strand and 1 5 nucleotides away on the antisense 

30 strand and the other represents the DNA molecules that were cut 10 nucleotides away on the sense 
strand and 14 nucleotides away on the antisense strand. The expected nucleotides also are 
indicated. 

FIG. 9D. Sequence analysis of SNP HC21S00027 amplified with a second primer that 
35 contained the recognition site for BsmF I. A mixture of labeled ddNTPs and unlabeled dNTPs was 
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used to fill in the 5* overhang generated by digestion with BsmF I. Two different 5' overhangs are 
depicted: one represents the DNA molecules that were cut 1 1 nucleotides away on the sense strand 
and 15 nucleotides away on the antisense strand and the other represents the DNA molecules that 
were cut 10 nucleotides away on the sense strand and 14 nucleotides away on the antisense strand. 
5 The nucleotide upstream from the SNP, the nucleotide at the SNP site (the sample contained DNA 
templates from 36 individuals; both nucleotides would be expected to be represented in the sample), 
and the three nucleotides downstream of the SNP are indicated. 

FIG. 10. Sequence analysis of multiple SNPs. SNPsHC21S00131, andHC21S00027, 
10 which are located on chromosome 21, and SNPs TSC00873 15, SNP TSC0214366, SNP 

TSC0413944, and SNP TSC0095512, which are on chromosome 1, were amplified in separate PCR 
reactions with second primers that contained a recognition site for BsmF I. The primers were 
designed so that each amplified locus of interest was of a different size. After amplification, the 
reactions were pooled into a single sample, and all subsequent steps of the method performed (as 
15 described for FIGS. 1F-1I) on that sample. Each SNP and the nucleotide found at each SNP are 
indicated. 

FIG. 11. Quantification of the percentage of fetal DNA in maternal blood. Blood was 
obtained from a pregnant human female with informed consent. DNA was isolated and serial 
20 dilutions were made to determine the percentage of fetal DNA present in the sample. The SRY 
gene, which is located on chromosome Y, was used to detect fetal DNA. The cystic fibrosis gene, 
which is located on chromosome 7, was used to detect both maternal and fetal DNA. 

FIG. 1 1 A. Amplification of the SRY gene and the cystic fibrosis gene using a DNA 
25 template isolated from a blood sample that was treated with EDTA. 

FIG. 1 IB. Amplification of the SRY gene and the cystic fibrosis gene using a DNA 
template that was isolated from a blood sample that was treated with formalin and EDTA. 

30 FIG. 12. Genetic analysis of an individual previously genotyped with Trisomy 21 (Down's 

Syndrome). Blood was collected, with informed consent, from an individual who had previously 
been genotyped with trisomy 21. DNA was isolated and two SNPs on chromosome 21 and two 
SNPs on chromosome 13 were genotyped. As shown in the photograph of the gel, the SNPs at 
chromosome 21 show disproportionate ratios of the two nucleotides. Visual inspection of the gel 

35 demonstrates that one nucleotide of the two nucleotides at the SNP sites analyzed for chromosome 
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21 is of greater intensity, suggesting it is not present in a 50:50 ratio. However, visual inspection of 
the gel suggests that the nucleotides at the heterozygous SNP sites analyzed on chromosome 13 are 
present in the expected 50:50 ratio. 

5 FIG. 13. Sequence determination of both alleles of SNPs TSC0837969, TSC0034767, 

TSC1 130902, TSC0597888, TSC0195492, TSC06071 85 using one fluorescently labeled ■ 
nucleotide. Labeled ddGTP was used in the presence of unlabeled dATP, dCTP, dTTP to fill-in the 
overhang generated by digestion with BsmF I. The nucleotide preceding the variable site on the 
strand that was filled-in was not guanine, and the nucleotide after the variable site on the strand that 

10 was filled in was not.guanine. The nucleotide two bases after the variable site on the strand that 
was filled-in was guanine. Alleles that contain guanine at variable site are filled in with labeled 
ddGTP. Alleles that do not contain guanine are filled in with unlabeled dATP, dCTP, or dTTP, and 
the polymerase continues to incorporate nucleotides until labeled ddGTP is filled in at position 3 
complementary to the overhang. 

15 

FIG. 14. Identification of SNPs with alleles that are variable within the population. The 
sequences of both alleles of seven SNPs located on chromosome 13 were determined using a 
template DNA comprised of DNA obtained from two hundred and forty five individuals. Labeled 
ddGTP was used in the presence of unlabeled dATP, dCTP, dTTP to fill-in the overhang generated 

20 by digestion with BsmF I. The nucleotide preceding the variable site on the strand that was filled-in 
was not guanine, and the nucleotide after the variable site on the strand that was filled in was not 
guanine. The nucleotide two bases after the variable site on the strand that was filled-in was 
guanine. Alleles that contain guanine at variable site are filled in with labeled ddGTP. Alleles that 
do not contain guanine are filled in with unlabeled dATP, dCTP, or dTTP, and the polymerase 

25 continues to incorporate nucleotides until labeled ddGTP is filled in at position 3 complementary to 
the overhang. 

FIG. 15. Determination of the ratio for one allele to the other allele at heterozygous SNPs. 
The observed nucleotides for SNP TSC0607185 are cytosine (referred to as allele 1) and thymidine 
30 (referred to as allele 2) on the sense strand. The ratio of allele 2 to allele 1 was calculated using 

template DNA isolated from five individuals. The ratio of allele 2 to allele 1 (allele 2 / allele 1) was 
consistently 1:1. 

The observed nucleotides for SNP TSC1 130902 are guanine (referred to as allele 1) and 
adenine (referred to as allele 2) on the sense strand. The ratio of allele 2 to allele 1 was calculated 
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using template DNA isolated from five individuals. The ratio of allele 2 to allele 1 (allele 2 / 
allele 1) was consistently 75:25. 

FIG. 16. The percentage of allele 2 to allele 1 at SNP TSC0108992>remains linear when 
5 calculated on template DNA containing an extra copy of chromosome 2 1 . SNP TSCO 1 08992 was 
amplified using template DNA from four individuals, and two separate fill-in reactions (labeled as 
A and B) were performed for each PCR reaction (labeled 1 through 4). The calculated percentage 
of allele 2 to allele 1 on template DNA from normal individuals was 0.47. The deviation from the 
theoretically predicted percentage of 0.50 remained linear on template DNA isolated from an 
1 0 individual with Down's syndrome. 

FIG. 17 A. Analysis of a SNP located on chromosome 21 from template DNA isolated 
from an individual with a normal genetic karyotype. SNP TSC0108992 was amplified using the 
methods described herein, and after digestion with the type IIS restriction emyme BsmF I, the 5* 
15 overhang was filled in using labeled ddTTP, and unlabeled dATP, dCTP, and dGTP. Three 

separate PCR reactions were performed, and each PCR reaction was split into two samples. The 
percentage of allele 2 at the SNP site (allele 2 / (allele 2 + allele 1)) was calculated, which resulted 
in mean of 0.50. 

20 FIG 17B. Analysis of a SNP located on chromosome 21 from template DNA isolated from 

an individual with a trisomy 21 genetic karyotype. SNP TSC0108992 was amplified using the 
methods described herein, and after digestion with the type IIS restriction enzyme BsmF I, the 5' 
overhang was filled in using labeled ddTTP, and unlabeled dATP, dCTP, and dGTP. Three 
separate PCR reactions were performed, and each PCR reaction was split into two samples. The 

25 percentage of allele 2 at the SNP site (allele 2 / (allele 2 + allele 1)) was calculated, which resulted 
in mean of 0.30. 

FIG. 17C. Analysis of a SNP located on chromosome 21 from a mixture comprised of 
template DNA from an individual with Trisomy 21, and template DNA from an individual with a 

30 normal genetic karyotype in a ratio of 3:1 (Trisomy 21 : Normal). SNP TSC0108992 was amplified 
from the mixture of template DNA using the methods described herein, and after digestion with the 
type IIS restriction enzyme BsmF I, the 5' overhang was filled in using labeled ddTTP, and 
unlabeled dATP, dCTP, and dGTP. Three separate PCR reactions were performed, and each PCR 
reaction was split into two samples. The percentage of allele 2 at the SNP site (allele 2 / (allele 2 + 

35 allele 1)) was calculated, which resulted in mean of 0.3 19. 
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FIG. 1 7D. Analysis of a SNP located on chromosome 21 from a mixture comprised of 
template DNA from an individual with Trisomy 21, and template DNA from an individual with a 
normal genetic karyotype in a ratio of 1 : 1 (Trisomy 2 1 : Normal). SNP TSCO 108992 was amplified 
5 from the mixture of template DNA using the methods described herein, and after digestion with the 
type IIS restriction enzyme BsmF I, the 5' overhang was filled in using labeled ddTTP, and 
unlabeled dATP, dCTP, and dGTP. Three separate PCR reactions were performed, and each PCR 
reaction was split into two samples. The percentage of allele 2 at the SNP site (allele 2 / (allele 2 + 
allele 1)) was calculated, which resulted in mean of 0.352. 

10 

FIG. 17E. Analysis of a SNP located on chromosome 21 from a mixture comprised of 
template DNA from an individual with Trisomy 21, and template DNA from an individual with a 
normal genetic karyotype in a ratio of 1:2.3 (Trisomy 21 : Normal). SNP TSC0108992 was 
amplified from the mixture of template DNA using the methods described herein, and after 
15 digestion with the type IIS restriction enzyme BsmF I, the 5' overhang was filled in using labeled 
ddTTP, and unlabeled dATP, dCTP, and dGTP. Three separate PCR reactions were performed, and 
each PCR reaction was split into two samples. The percentage of allele 2 at the SNP site (allele 2 / 
(allele 2 + allele 1)) was calculated, which resulted in mean of 0.382. 

20 FIG. 17F. Analysis of a SNP located on chromosome 21 from a mixture comprised of 

template DNA from an individual with Trisomy 21, and template DNA from an individual with a 
normal genetic karyotype in a ratio of 1:4 (Trisomy 21: Normal). SNP TSC0108992 was amplified 
from the mixture of template DNA using the methods described herein, and after digestion with the 
type IIS restriction enzyme BsmF I, the 5' overhang was filled in using labeled ddTTP, and 

25 unlabeled dATP, dCTP, and dGTP. Three separate PCR reactions were performed, and each PCR 
reaction was split into two samples. The percentage of allele 2 at the SNP site (allele 2 / (allele 2 + 
allele 1)) was calculated, which resulted in mean of 0.397. 

FIG. 18A. Agarose gel analysis of nine (9) SNPs amplified from template DNA. Each of 
30 the nine SNPs were amplified from genomic DNA using the methods described herein. Lane 1 
corresponds .to SNP TSC0397235, lane 2 corresponds to TSC0470003, lane 3 corresponds to 
TSC1649726, lane 4 corresponds to TSC1261039, lane 5 corresponds to TSC03 10507, lane 6 
corresponds to TSC1650432, lane 7 corresponds to TSC1335008, lane 8 corresponds to 
TSC0128307, and lane 9 corresponds to TSC0259757. 

35 
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FIG. 18B. The original template DNA was amplified using 12 base primers that annealed 
to various regions on chromosome 13. One hundred different primer sets were used to amplify 
regions throughout chromosome 13. For each of the nine SNPs, a primer that annealed 
approximately 130 bases from the locus of interest and 130 bases downstream of the locus of 
interest were used. This amplification reaction, which contained a total of 100 different primer sets, 
was used to amplify the regions containing the loci of interest. The resulting PCR product was used 
in a subsequent PCR reaction, wherein each of the nine SNPs were individually amplified using a 
first primer and a second primer, wherein the second primer contained the binding site for the type 
lis restriction enzyme BsmF I. SNPs were loaded in the same order as FIG. 18 A. 

FIG. 19A. Quantification of the percentage of allele 2 to allele 1 for SNP TSC047003 on 
original template DNA (IA) and multiplexed template DNA (M1-M3), wherein the DNA was first 
amplified using 12 base ppmers that annealed 150 bases upstream and downstream of the loci of 
interest Then, three separate PCR reactions were performed on the multiplexed template DNA, 
using a first and second primer. 

FIG. 19B. Quantification of the percentage of allele 2 to allele 1 for SNP TSC1261039 on 
original template DNA (IA) and multiplexed template DNA (M1-M3), wherein the DNA was first 
amplified using 12 base primers that annealed 150 bases upstream and downstream of the loci of 
interest. Then, three separate PCR reactions were performed on the multiplexed template DNA, 
using a first and second primer. 

FIG. 19C. Quantification of the percentage of allele 2 to allele 1 for SNP TSC3 10507 on 
original template DNA (IA) and multiplexed template DNA (M1-M3), wherein the DNA was first 
amplified using 12 base primers that annealed 150 bases upstream and downstream of the loci of 
interest. Then, three separate PCR reactions were performed on the multiplexed template DNA, 
using a first and second primer. 

FIG. 19D. Quantification of the percentage of allele 2 to allele 1 for SNP TSC1335008 on 
original template DNA (IA) and multiplexed template DNA (M1-M3), wherein the DNA was first 
amplified using 12 base primers that annealed 150 bases upstream and downstream of the loci of 
interest. Then, three separate PCR reactions were performed on the multiplexed template DNA, 
using a first and second primer. 

FIG. 20. Detection of fetal DNA from plasma DNA isolated from a pregnant female. Four 
SNPs wherein the maternal DNA was homozygous were analyzed on the plasma DNA. The 
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maternal DNA was homozygous for adenine at TSC0838335 (lane 1), while the plasma DNA 
displayed a heterozygous pattern (lane 2). The guanine allele represented the fetal DNA, which was 
clearly distinguished from the maternal signal. Both the maternal DNA and the plasma DNA were 
homozygous for adenine at TSC0418134 (lanes 3 and 4). The maternal DNA was homozygous for 
5 guanine at TSC0129188 (lane 5), while the plasma DNA displayed a heterozygous pattern (lane 6). 
The adenine allele represented the fetal DNA. Both the maternal DNA and tfce plasma DNA were 
homozygous for adenine at TSC0501389 (lanes 7 and 8). 

DETAILED DESCRIPTION OF THE INVENTION 

10 The present invention provides a method for detecting genetic disorders, including but not 

limited to mutations, insertions, deletions, and chromosomal abnormalities, and is especially useful 
for the detection of genetic disorders of a fetus. The method is especially useful for detection of a 
translocation, addition, amplification, transversion, inversion, aneuploidy, polyploidy, monosomy, 
trisomy, trisomy 21, trisomy 13, trisomy 14, trisomy 15, trisomy 16, trisomy 18, trisomy 22, 

1 5 triploidy, tetraploidy, and sex chromosome abnormalities including but not limited to XO, XXY, 
XYY, and XXX. The method also provides a non-invasive technique for determining the sequence 
of fetal DNA and identifying mutations within the fetal DNA. 

The invention is directed to a method for detecting chromosomal abnormalities, the method 
comprising: (a) determining the sequence of alleles of a locus of interest on a template DNA; and 

20 (b) quantitating a ratio for the alleles at a heterozygous locus of interest that was identified from the 
locus of interest of (a), wherein said ratio indicates the presence or absence of a chromosomal 
abnormality. 

In another embodiment, the present invention provides a non-invasive method for 
determining the sequence of a locus of interest on fetal DNA, said method comprising: (a) obtaining 

25 a sample from a pregnant female; (b) adding a cell lysis inhibitor, cell membrane stabilizer or cross- 
linker to the sample of (a); (c) obtaining template DNA from the sample of (b), wherein said 
template DNA comprises fetal DNA and maternal DNA; and (d) determining the sequence of a 
locus of interest on template DNA. 

In another emobodiment, the present invention is directed to a method for isolating DNA, 

30 said method comprising (a) obtaining a sample containing nucleic acid; (b) adding a cell lysis 
inhibitor, cell membrane stabilizer or cross-linker to sample of (a); and (c) isolating the DNA. In 
another embodiment, the method is used for isolating free nucleic acid in a sample wherein the free 
nucleic acid is from a species different than the species from which the sample was taken. In 
another embodiment, the method is used for isolating free nucleic acid selected from the group 

35 consisting of bacteria, viruses, fungi, mycobacteria, protozoa, molds, yeasts, plants, humans, non- 
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humans, multi-cellular parasites, animals, and archeabacteria. In another embodiment, the sample 
containing nucleic acid may contain more than one type of nucleic acid including but not limited to 
human and bacterial, human and viral, human and fungal, human, bacterial, and fungal, human, 
bacterial, fungal, and viral, human, bacterial and viral, and human, fungal and viral. 
5 In another embodiment, the present invention provides a method for detecting nucleic acid, 

wherein said method comprises: (a) isolating nucleic acid from a nucleic acid containing sample, 
wherein a cell lysis inhibitor, cell membrane stabilizer, or cross-linker was added to the sample; and 
(b) detecting the presence or absence of the nucleic acid. In a preferred embodiment, the nucleic 
acid isolated is free nucleic acid. In another embodiment, the method is used for isolating free 

1 0 nucleic acid in a sample, wherein the free nucleic acid is from a species different than the species 
from which the sample was taken. In another embodiment, the nucleic acid can be from any 
organism including but not limited to bacteria, viruses, fungi, mycobacteria, protozoa, molds, 
yeasts, plants, humans, non-humans, multi-cellular parasites, animals, and archeabacteria. 

In another embodiment, the present invention provides a method for detecting nucleic acid 

IS containing at least one mutation, wherein said method comprises: (a) isolating nucleic acid from a 
nucleic acid containing sample, wherein a cell lysis inhibitor, cell membrane stabilizer, or cross- 
linker was added to the sample; and (b) detecting the presence or absence of nucleic acid containing 
the mutation. In a preferred embodiment, the nucleic acid isolated is free nucleic acid. In another 
embodiment, the method is used for isolating free nucleic acid in a sample, wherein the free nucleic 

20 acid is from a species different than the species from which the sample was taken. In another 
embodiment, the nucleic acid can be from any organism including but not limited to bacteria, 
viruses, fungi, mycobacteria, protozoa, molds, yeasts, plants, humans, non-humans, multi-cellular 
parasites, animals, and archeabacteria. In another embodiment, the method is used to detect or 
monitor a disease state including but not limited to diseases listed in Tables V and XV. 

25 The nucleic acid containing sample can contain nucleic acid from multiple species. For 

example, to detect a bacterial infection in a human, a sample may contain the bacterial nucleic acid, 
and human nucleic acid. The nucleic acid can contain both normal and mutant forms of nucleic 
acid from the same species. For instance, for detection of a multi-drug resistant bacterial strain, a 
sample may contain nucleic acid from the multi-drug resistant bacteria, and nucleic acid from the 

3 0 drug-sensitive bacteria and human nucleic acid (see Example 24). 

In a preferred embodiment, the method is used to detect, diagnose or monitor cancer 
including but not limited to carcinoma of the bladder, breast, bronchial, colon, kidney, liver, lung, 
esophagus, gall-bladder, ovary, pancreas, stomach, cervix, thyroid, prostate, and skin; small cell 
lung cancer, squamous cell carcinoma, hematopoietic tumors of lymphoid lineage, leukemia, acute 

35 lymphocytic leukemia, acute lymphoblastic leukemia, B-cell lymphoma, T-cell-lymphoma, 
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Hodgkin's lymphoma, non-Hodgkin's lymphoma, hairy cell lymphoma, Burkett's lymphoma, 
hematopoietic tumors of myeloid lineage, acute and chronic myelogenous leukemias, 
myelodysplastic syndrome and promyelocytic leukemia, tumors of mesenchymal origin, 
fibrosarcoma and rhabdomyosarcoma, tumors of the central and peripheral nervous system, 
5 astrocytoma, neuroblastoma, glioma and schwannomas, melanoma, seminoma, teratocarcinoma, 
osteosarcoma, xenoderoma pigmentosum, keratoctanthoma, thyroid follicular cancer and Kaposi's 
sarcoma. 

In another embodiment, the method is useful for monitoring a treatment including but not 
limited to a procedure e.g. surgery, radiation, lifestyle change, dietary protocol, supplementation or 

1 0 administration of a drug or drugs, or combinations of any of the above. When the treatment 
includes the administration of a drug, the drug may be any drug including but not limited to 
chemotherapeutic agents, anti-bacterial agents, anti-viral agents, anti-fungal agents, targeted-cancer 
drugs, cytoxoic agents, cytostatic agents, anti-proliferative agents, Avastin, altretamine, busulfan, 
chlorambucil, cyclophosphamide, Erbitux, Rituxan, ifosfamide, mechlorethamine, melphalan, 

15 thiotepa, cladribine, fluorouracil, floxuridine, gemcitabine, thioguanine, pentostatin, methotrexate, 
6-mercaptopurine, cytarabine, carmustine, lomustine, streptozotocin, carboplatin, cisplatin, 
oxaliplatin, iproplatin, tetraplatin, lobaplatin, JM216, JM335, fludarabine, aminoglutethimide, 
flutamide, goserelin, leuprolide, megestrol acetate, cyproterone acetate, tamoxifen, anastrozole, 
bicalutamide, dexamethasone, diethylstilbestrol, prednisone, bleomycin, dactinomycin, 

20 daunorubicin, doxirubicin, idarubicin, mitoxantrone, losoxantrone, mitomycin-c, plicamycin, 

paclitaxel, docetaxel, CPT-1 1, epothilones , topotecan, irinotecan, 9-amino camptothecan, 9-nitro 

♦ 

camptothecan, GS-21 1, etoposide, teniposide, vinblastine, vincristine, vinorelbine, procarbazine, 
asparaginase, pegaspargase, methoxtrexate, octreotide, estramustine, and hydroxyurea. 

In another embodiment, the present invention provides a method for detecting nucleic acid 

25 containing at least one mutation, wherein said method comprises: (a) isolating nucleic acid from a 
blood sample, wherein a cell lysis inhibitor, cell membrane stabilizer, or cross-linker was added to 
the sample; and (b) detecting the presence or absence of a nucleic acid molecule containing the 
mutation. In a preferred embodiment, the nucleic acid isolated is free nucleic acid. In another 
embodiment, the method is used for isolating free nucleic acid in a sample, wherein the free nucleic 

30 acid is from a species different than the species from which the sample was taken. In another 
embodiment, the nucleic acid can be from any organism including but not limited to bacteria, 
viruses, fungi, mycobacteria, protozoa, molds, yeasts, plants, humans, non-humans, multi-cellular 
parasites, animals, and archeabacteria. 

In another embodiment, the present invention provides a method for detecting a nucleic 

35 acid molecule containing at least one mutation, wherein said method comprises: (a) isolating 
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nucleic acid from a plasma sample, wherein a cell lysis inhibitor, cell membrane stabilizer, or cross- 
linker was added to a blood sample prior to isolation of the plasma; and (b) detecting the presence 
or absence of a nucleic acid molecule containing the mutation. In a preferred embodiment, the 
nucleic acid isolated is free nucleic acid. In another embodiment, the method is used for isolating 
5 free nucleic acid in a sample, wherein the free nucleic acid is from a species different than the 

species from which the sample was taken. In another embodiment, the nucleic acid can be from any 
organism including but not limited to bacteria, viruses, fungi, mycobacteria, protozoa, molds, 
yeasts, plants, humans, non-humans, multi-cellular parasites, animals, and archeabacteria. In some 
embodiments, isolated nucleic acid is from a drug resistant strain of bacteria, virus or fungus. 

10 In another embodiment, the nucleic acid can be isolated from any appropriate sample 

including but not limited to, nucleic acid-containing samples of tissue, bodily fluid (for example, 
blood, serum, plasma, saliva, urine, tears, peritoneal fluid, ascitic fluid, vaginal secretion, breast 
fluid, breast milk, lymph fluid, cerebrospinal fluid or mucosa secretion), umbilical cord blood, 
chorionic villi, amniotic fluid, an embryo, a two-celled embryo, a four-celled embryo, an 

15 eight-celled embryo, a 16-celled embryo, a 32- celled embryo, a 64-celled embryo, a 128-celled 
embryo, a 256-celled embryo, a 512-celled embiyo, a 1024-celled embryo, embryonic tissues, 
lymph fluid, cerebrospinal fluid, mucosa secretion, or other body exudate, fecal matter, an 
individual cell or extract of the such sources that contain the nucleic acid of the same, and 
subcellular structures such as mitochondria, using protocols well established within the art. In some 

20 embodiments, isolated nucleic acid is from a drug resistant strain of bacteria, virus or fungus. 

In another embodiment, the free nucleic acid is isolated using techniques and/or protocols 
that substantially reduce the lysis of cells present in the sample including but not limited to 
centrifiiging the samples, with the braking power for the centrifuge set to zero (the brake on the 
centrifuge is not used), transferring the supernatant to a new tube with minimal or no disturbance of 

25 the "buffy-coat," and transferring only a portion of the supernatant to a new tube. In a preferred 
embodiment, both acceleration power and braking power for the centrifuge are set to zero. 

In a preferred embodiment, the nucleic acid is deoxyribonucleic acid. In another 
embodiment, the mutation can be any alteration that differs from the normal DNA sequence 
including but not limited to a single point mutation, multiple point mutations, an insertion, a 

30 frameshift, a truncation, a deletion, a duplication, or a transversion. In another embodiment, the 
mutation can be a mutation in any gene including but not limited to BRCA1, BRCA2, MSH6, 
MSH2, MLH1, RET, PTEN, ATM, H-RAS, p53, ELAC2, CDH1, APC, AR, PMS2, MLH3, 
CYP1A1, GSTP1, GSTM1, AX1N2, CYP19, MET, NAT1, CDKN2A, NQ01, trc8, RAD51, PMS1, 
TGFBR2, VHL, MC4R, POMC, NROB2, UCP2, PCSK1, PPARG, ADRB?, UCP3, glurl, cart, 

35 SORBS1, LEP, LEPR, SIM1, TNF, IL-6, IL-1, IL-2, DL-3, ILIA, TAP2, THPO, THRB, NBS1, 
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RBM15, LEF, MPL, RUNX1, Her-2, glucocorticoid receptor, estrogen receptor, thyroid receptor, 
p21, p27, K-RAS, N-RAS, retinoblastoma protein, Wiskott-Aldrich (WAS) gene, Factor V Leiden, 
Factor II (prothrombin), methylene tetrahydrofolate reductase, cystic fibrosis, LDL receptor, HDL 
receptor, superoxide dismutase gene, SHOX gene, genes involved in nitric oxide regulation, genes 
5 involved in cell cycle regulation, tumor suppressor genes, oncogenes, genes associated with 
neurodegeneration, genes associated with obesity. In another embodiment, f the mutation can be 
associated with any disease including but not limited to the diseases listed in Tables V, and Table 
XV. 

In another embodiment, the free nucleic acid to be isolated is bacterial and can be any 
10 bacterial nucleic acid including but not limited to Cocci (round-shaped) bacteria, Bacilli (rod- 
shaped) bacteria, Spixilli (corkscrew-shaped) bacteria, pathogenic bacteria, non-pathogenic bacteria, 
normal flora bacteria, gram positive bacteria, gram-negative bacteria, drug-sensitive bacteria, drug- 
resistant bacteria, Acidaminococcus, Acinetobacter, Acinetobacter lwoffi, Aeromonas, Alcaligenes, 
Bacteroides, Bordetella, Branhamella, Brucella, Calymmatobacterium, Campylobacter, 

1 5 Cardiobacterium, Chromobacterium, Citrobacter, Citrobacter freundii, Coliform group, 

Edwardsiella, Enterobacter, Enterobacter sakazaki, Enterobacter aerogenes, Enterobacter cloacae, 
Enterobacter agglomerans, Enterococcus, Enterococcus faecalis, Enterococcus faecium, 
Escherichia, Escherichia coli, Escherichia coli-0157, Flavobacterium, Francisella, Fusobacterium, 
Haemophilus, Hafhia alvei, Klebsiella, Klebsiella oxytoca, Klebsiella pneumoniae, Legionella, 

20 Moraxella, Morganella, Morganella morganii, Neisseria, Pasturella, Plesiomonas, Proteus, 

Providencia, Proteus mirabilis, Pseudomonas, Pseudomonas aeruginosa, Salmonella, Salmonella 
typhimurium, Serratia, Serratia marcescens, Shigella, Shigella flexneri, Strefptobacillus, Veillonella, 
Vibrio, Vibrio cholera, Yersinia, Yersinnia entercolitica, Xanthomanas maltophiia, Staphylococcus, 
Staphylococcus albus, Staphylococcus aureus, Streptococcus, Streptococcus dysgalacticae, 

25 Micrococcus, Peptococcus, Peptostreptococcus, Bacillus, Bacillus cereus, Clostridium, 

Lactobacillus, Listeria, Listeria monocytogenes, Erysipelothrix, Propionibacterium, Eubacterium, 
and Coiynebacterium species. 

In another embodiment, the nucleic acid to be isolated is fungal, and can be any fungal 
nucleic acid including but not limited to drug-sensitive fungus, drug-resistant fongus, Candida 

30 (including C. albicans, C. tropicalis, C. parapsilosis, C. stellatoidea, C. krusei, C. parakrusei, C. 
lusitanae, C. pseudotropicalis, C guilliermondi and C. glabrata), Aspergillus (including A. 
fumigatus, A. flavus, A. niger, A. nidulans, A. terreus, A. sydowi, A. flavatus, and A. glaucus), 
Cryptococcus, Histoplasma, Coccidioides, Paracoccidioides, Blastomyces, Basidiobolus, 
Conidiobolus, Rhizopus, Rhizomucor, Mucor, Absidia, Mortierella, Cunninghamella, Saksenaea, 

35 Pseudallescheria, Sporotrichosis, Fusarium, Trichophyton, Trichosporon, Microsporum, 
i 
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Epidermophyton, Scytalidium, Malassezia, Actinomycetes, Sporothrix, Penicillium, Saccharomyces 
and Pneumocystis. 

In another embodiment, the nucleic acid to be isolated can be any viral nucleic acid 
including but not limited to DNA virus, KNA virus, retrovirus, pathogenic virus, non-pathogenic 
5 virus, drug-resistant virus, drug-sensitive virus, adeno-associated virus, bird flu virus, cauliflower 
mosaic virus, Chlamydia caused by viral infection, cytomegalovirus (CMV), dengue virus, Epstein- 
Barr virus, feline leukemia virus, flavivirus, haemophilus influenza, hemorrhagic fever viruses, 
hepatitis virus including hepatitis A, B, C, and E, viruses, herpes simplex virus, human herpesvirus 
type A and B, human immunodeficiency virus (HIV), human papilloma virus, human T-cell 

10 lymphotrophic virus, HTLV Type I, HTLV Type n, influenza virus, Japanese encephalitis virus, 
moraxella catarrhalis, non-typeable haemophilus, reovirus, parainfluenza, parvovirus, papova virus, 
Respiratory syncytial virus, Rubella virus, rotavirus, SARS, tomato bushy stunt virus, varicella- 
zoster virus, and vaccinia virus. 

In a preferred embodiment, free nucleic acid is isolated from plasma obtained from blood. 

15 In another embodiment, the free nucleic acid is isolated from plasma, which is generated by 
centrifuging blood, and allowing the centrifuge to stop without applying a brake (the centrifuge 
comes to a stop by natural deceleration). In another embodiment, the blood is centrifiiged at a 
speed including but not limited to 0-50 rpm, 50-100 rpm, 100-200 rpm, 200-300 rpm, 300-400 rpm, 
400-500 rpm, 500-600 rpm, 600-700 rpm, 700-800 rpm, 800-900 rpm, 900-1000 rpm, 1000-2000 

20 rpm, 2000-3000 rpm, 3000-4000 rpm, 4000-5000 rpm, 5000-6000 rpm, 6000-7000 rpm, 7000-8000 
rpm, and greater than 8000 rpm. In a preferred embodiment, the blood is centrifiiged at a speed less 
than 4000 rpm. In another embodiment, the acceleration power of the centrifuge is not used. 

In another embodiment, blood is collected by any method or process that results in a 
substantial increase in the ratio of mutant DNA/normal DNA in the resulting serum or plasma after 

25 appropriate processing. As used herein, a substantial increase in the ratio of mutant DNA/ normal 
DNA is that which can be detected by the methods as described herein. Such methods or processes 
typically result in a substantial increase in the ratio of mutant DNA/normal DNA of about 5%, 10%, 
15%, 20%, 30%, 50%, 70%, 80%, 100%, 150%, 200%, 250%, 300%, 350%, 400%, 450%, 500%, 
1000% or more of the ratio of mutant DNA/normal DNA found in blood samples collected by 

3 0 standard procedures. 

In other embodiments, blood is collected by any method or process that results in a 
substantial increase in the amount of free DNA compared to the amount of total DNA recovered or 
detected in the resulting serum or plasma after processing. Such methods or processes typically 
result in a substantial increase so the free DNA recovered or detected is about 1%, 5%, 10%, 15%, 
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20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more of the total DNA recovered or 
detected in the processed plasma or serum sample. 

In another embodiment, the sample is collected by any method or process that results in a 
substantial increase in the ratio of mutant DNA to normal DNA after appropriate processing. As 
5 used herein, a substantial increase in the ratio of mutant DNA to normal DNA is that which can be 
detected by the methods as described herein. Such methods or processes typically result in a 
substantial increase in the ratio of mutant DNA to normal DNA of about 5%, 10%, 15%, 20%, 
30%, 50%, 70%, 80%, 100%, 150%, 200%, 250%, 300%, 350%, 400%, 450%, 500%, 1000% or 
more of the ratio of mutant DNA/normal DNA found in samples collected by standard procedures. 

10 In another embodiment, the methods or processes of collecting blood samples may also 

include other steps that result in lessened or reduced cell lysis. For instance, blood collection 
devices may be modified to decrease cell lysis due to sheer forces in the collection needle, syringe 
or tubes used. For instance, needles of large gauge may be employed to reduce cell sheering or 
vacutainer tubes may be modified to reduce the velocity of blood flow. In another embodiment, the 

15 methods or processes of collecting blood samples may also include other steps that result in 
lessened or reduced destruction of free DNA. 

In another embodiment, the methods or processes of collecting samples may also include 
other steps that result in lessened or reduced cell lysis. For example, urine samples can be collected 
and carefully handled and stored to reduce cell lysis. In another embodiment, the methods or 

20 processes of collecting samples may also include other steps that result in lessened or reduced 
destruction of free DNA. 

In another embodiment, the present invention is directed to a composition comprising free 
mutant DNA and free normal DNA, wherein the relationship of free mutant DNA to free normal 
DNA in the composition includes but is not limited to at least about 5% free mutant DNA, at least 

25 about 10% free mutant DNA, at least about 15% free mutant DNA, at least about 20% free mutant 
DNA, at least about 30% free mutant DNA, at least about 40% free mutant pNA, at least about 
50% free mutant DNA, at least about 60% free mutant DNA, at least about 70% free mutant DNA, 
at least about 80% free mutant DNA, at least about 90% free mutant DNA, at least about 91% free 
mutant DNA, at least about 92% free mutant DNA, at least about 93% free mutant DNA, at least 

30 about 94% free mutant DNA, at least about 95% free mutant DNA, at least about 96% free mutant 
DNA, at least about 97% free mutant DNA, at least about 98% free mutant DNA, at least about 
99% free mutant DNA, and at least about 99.5% free mutant DNA. 

In another embodiment, the present invention is directed to a method of using a 
composition comprising free mutant DNA and free normal DNA for diagnostics, wherein the 

35 relationship of free mutant DNA to free normal DNA in the composition includes but is not limited 
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to at least about 5% free mutant DNA, at least about 10% free mutant DNA, at least about 15% free 

mutant DNA, at least about 20% free mutant DNA, at least about 30% free mutant DNA, at least 

about 40% free mutant DNA, at least about 50% free mutant DNA, at least about 60% free mutant 

DNA, at least about 70% free mutant DNA, at least about 80% free mutant DNA, at least about 

5 90% free mutant DNA, at least about 91% free mutant DNA, at least about 92% free mutant DNA, 

at least about 93% free mutant DNA, at least about 94% free mutant DNA, at least about 95% free 

mutant DNA, at least about 96% free mutant DNA, at least about 97% free mutant DNA, at least 

♦ 

about 98% free mutant DNA, at least about 99% free mutant DNA, and at least about 99.5% free 
mutant DNA. 

10 In another embodiment, the present invention is directed to a composition comprising free 

mutant DNA and free normal DNA, wherein the relationship of free mutant DNA to free normal 
DNA in the composition includes but not is limited to about 5-10% free mutant DNA, about 10- . 
13% free mutant DNA, about 13-15% free mutant DNA, about 15-16% free mutant DNA, about 
16-17% free mutant DNA, about 17-18% free mutant DNA, about 18-19% free mutant DNA, about 

15 19-20% free mutant DNA, about 20-21% free mutant DNA, about 2 1-22% free mutant DNA, about 
22-23% free mutant DNA, about 23-24% free mutant DNA, about 24-25% free mutant DNA, about 
25-35% free mutant DNA, about 35-45% free mutant DNA, about 45-55% free mutant DNA, about 
55-65% free mutant DNA, about 65-75% free mutant DNA, about 75-85% free mutant DNA, about 
85-90% free mutant DNA, about 90-91% free mutant DNA, about 91-92% free mutant DNA, about 

20 92-93% free mutant DNA, about 93-94% free mutant DNA, about 94-95% free mutant DNA, about 
95-96% free mutant DNA, about 96-97% free mutant DNA, about 97-98% free mutant DNA, about 
98-99% free mutant DNA, and about 99-99.7% free mutant DNA. 

In another embodiment, the present invention is directed to a method of using a 
composition comprising free mutant DNA and free normal DNA for diagnostics, wherein the 

25 relationship of free mutant DNA to free normal DNA in the composition includes but is not limited 
to about 5-10% free mutant DNA, about 10-13% free mutant DNA, about 13-15% free mutant 
DNA, about 15-16% free mutant DNA, about 16-17% free mutant DNA, about 17-18% free mutant 
DNA, about 18-19% free mutant DNA, about 19-20% free mutant DNA, about 20-21% free mutant 
DNA, about 21-22% free mutant DNA, about 22-23% free mutant DNA, about 23-24% free mutant 

30 DNA, about 24-25% free mutant DNA, about 25-35% free mutant DNA, about 35-45% free mutant 
DNA, about 45-55% free mutant DNA, about 55-65% free mutant DNA, about 65-75% free mutant 
DNA, about 75-85% free mutant DNA, about 85-90% free mutant DNA, about 90-91% free mutant 
DNA, about 91-92% free mutant DNA, about 92-93% free mutant DNA, about 93-94% free mutant 
DNA, about 94-95% free mutant DNA, about 95-96% free mutant DNA, about 96-97% free mutant 
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DNA, about 97-98% free mutant DNA, about 98-99% free mutant DNA, or about 99-99.7% free 
mutant DNA. 

In another embodiment, the present invention is directed to a composition comprising free 

i 

mutant DNA and free normal DNA, wherein the relationship of free mutant DNA to free normal 
5 DNA in the composition includes but is not limited to a maximum of about 5-10% free mutant 
DNA, a maximum of about 10-13% free mutant DNA, a maximum of about 13%- 15% free mutant 
DNA, a maximum of about 15-18% free mutant DNA, a maximum of about 18-20% free mutant 
DNA, a maximum of about 20-40% free mutant DNA, a maximum of about 40-50% free mutant 
DNA, a maximum of about 50-60% free mutant DNA, a maximum of about 60-70% free mutant 

10 DNA, a maximum of about 70-80% free mutant DNA, a maximum of about 80-90% free mutant 
DNA, a maximum of about 90-92% free mutant DNA, a maximum of about 92-94% free mutant 
DNA, a maximum of about 94-95% free mutant DNA, a maximum of about 95-96% free mutant 
DNA, a maximum of about 96-97% free mutant DNA, a maximum of about 97-98% free mutant 
DNA, a maximum of about 98-99% free mutant DNA, a maximum of about 99-99.5% free mutant 

1 5 DNA, and a maximum of about 99.5-99.9% free mutant DNA. 

In another embodiment, the present invention is directed to a method of using a 
composition comprising free mutant DNA and free normal DNA for diagnostics, wherein the 
relationship of free mutant DNA to free normal DNA in the composition includes but is not limited 
a maximum of about 5-10% free mutant DNA, a maximum of about 10-13% free mutant DNA, a 

20 maximum of about 13%-15% free mutant DNA, a maximum of about 15-18,% free mutant DNA, a 
maximum of about 18-20% free mutant DNA, a maximum of about 20-40% free mutant DNA, a 
maximum of about 40-50% free mutant DNA, a maximum of about 50-60% free mutant DNA, a 
maximum of about 60-70% free mutant DNA, a maximum of about 70-80% free mutant DNA, a 
maximum of about 80-90% free mutant DNA, a maximum of about 90-92% free mutant DNA, a 

25 maximum of about 92-94% free mutant DNA, a maximum of about 94-95% free mutant DNA, a 
maximum of about 95-96% free mutant DNA, a maximum of about 96-97% free mutant DNA, a 
maximum of about 97-98% free mutant DNA, a maximum of about 98-99% free mutant DNA, a 
maximum of about 99-99.5% free mutant DNA, and a maximum of about 99.5-99.9% free mutant 
DNA. 

30 In another embodiment, the present invention is directed to a method for isolating free 

DNA, said method comprising (a) obtaining a sample containing nucleic acid; (b) adding a cell lysis 
inhibitor, cell membrane stabilizer or cross-linker to sample of (a); and (c) isolating the DNA. 

In another embodiment, the present invention is directed to a method for isolating free 
DNA from a sample containing nucleic acid to which a cell lysis inhibitor, cell membrane stabilizer 

35 or cross-linker has been added, said method comprising isolating the DNA. 

» 
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In another emobodiment, the present invention is directed to a method for isolating free 
fetal DNA, said method comprising (a) obtaining a sample containing nucleic acid; (b) adding a cell 
lysis inhibitor, cell membrane stabilizer or cross-linker to sample of (a); and (c) isolating the DNA. 
In another embodiment, the DNA is isolated using any technique suitable in the art including but 
5 not limited to cesium chloride gradients, gradients, sucrose gradients, glucose gradients, 

centrifugation protocols, boiling, Qiagen purification systems, QIA DNA blood purification kit, 
HiSpeed Plasmid Maxi Kit, QIAfilter plasmid kit, Promega DNA purification systems, MangeSil 
Paramagnetic Particle based systems, Wizard SV technology, Wizard Genomic DNA purification 
kit, Amersham purification systems, GFX Genomic Blood DNA purification kit, Invitrogen Life 

1 0 Technologies Purification Systems, CONCERT purification system, Mo Bio Laboratories 
purification systems, UltraClean BloodSpin Kits, and UlraClean Blood DNA Kit. 

In another emobodiment, the present invention is directed to a method for isolating free 
fetal DNA from a sample containing nucleic acid to which a cell lysis inhibitor, cell membrane 
stabilizer or cross-linker has been added, said method comprising isolating the DNA. In a preferred 

1 5 embodiment, the free fetal DNA is isolated from plasma or serum obtained from the blood of a 
pregnant female. 

In another embodiment, the DNA is isolated using techniques and/or protocols that 
substantially reduce the amount of maternal DNA in the sample including but not limited to 
centrifuging the samples, with the braking power for the centrifuge set to zero (the brake on the 
20 centrifuge is not used), transferring the supernatant to a new tube with minimal or no disturbance of 
the "bufly-coat," and transferring only a portion of the supernatant to a new tube. In a preferred 
embodiment, both acceleration power and braking power for the centrifuge are set to zero. 

In another embodiment, the DNA is isolated using techniques and/Qr protocols that 
substantially reduce the amount of maternal DNA in the sample including but not limited to 
25 centrifuging the samples, with the acceleration power for the centrifuge set to zero, transferring the 
supernatant to a new tube with minimal or no disturbance of the "buffy-coat," and transferring only 
a portion of the supernatant to a new tube. 

In another embodiment, the "buffy-coat" is removed from the tube prior to removal of the 
supernatant using any applicable method including but not limited to using a syringe or needle to 
30 withdraw the "buffy-coat" 

In another embodiment, the braking power for the centrifuge is set at a percentage including 
but not limited to 1-5%, 5-10%, 10-20%, 20-30%, 30-40%, 40-50%, 50-60%, 60-70%, 70-80%, 80- 
90%, 90-95%, 95-99% of maximum braking power. 
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In another embodiment, the acceleration power for the centrifuge is set at a percentage 
including but not limited to 1-5%, 5-10%, 10-20%, 20-30%, 30-40%, 40-50%, 50-60%, 60-70%, 
70-80%, 80-90%, 90-95%, 95-99% of maximum acceleration power. 

In another embodiment, the present invention is directed to a composition comprising free 
5 fetal DNA and free maternal DNA, wherein the composition comprises a relationship of free fetal 
DNA to free maternal DNA including but not limited to at least about 15% free fetal DNA, at least 
about 20% free fetal DNA, at least about 30% free fetal DNA, at least about 40% free fetal DNA, at 
least about 50% free fetal DNA, at least about 60% free fetal DNA, at least about 70% free fetal 
DNA, at least about 80% free fetal DNA, at least about 90% free fetal DNA, at least about 91% free 

10 fetal DNA, at least about 92% free fetal DNA, at least about 93% free fetal DNA, at least about 
94% free fetal DNA, at least about 95% free fetal DNA, at least about 96% free fetal DNA, at least 
about 97% free fetal DNA, at least about 98% free fetal DNA, at least about 99% free fetal DNA, 
and at least about 99.5% free fetal DNA. 

In another embodiment, the present invention is directed to a method of using a 

15 composition comprising free fetal DNA and free maternal DNA for prenatal diagnostics, wherein 
the composition comprises a relationship of free fetal DNA to free maternal DNA including but not 
limited to at least about 15% free fetal DNA, at least about 20% free fetal DNA, at least about 30% 
free fetal DNA, at least about 40% free fetal DNA, at least about 50% free fetal DNA, at least about 
60% free fetal DNA, at least about 70% free fetal DNA, at least about 80% free fetal DNA, at least 

20 about 90% free fetal DNA, at least about 91% free fetal DNA, at least about 92% free fetal DNA, at 
least about 93% free fetal DNA, at least about 94% free fetal DNA, at least about 95% free fetal 
DMA, at least about 96% free fetal DNA, at least about 97% free fetal DNA, at least about 98% free 
fetal DNA, at least about 99% free fetal DNA, and at least about 99.5% free fetal DNA. 

In another embodiment, the present invention is directed to a composition comprising free 

25 fetal DNA and free maternal DNA , wherein the composition comprises a relationship of free fetal 
DNA to free maternal DNA including but not limited to about 13-15% free fetal DNA, about 15- 
16% free fetal DNA, about 16-17% free fetal DNA, about 17-18% free fetal DNA, about 18-19% 
free fetal DNA, about 19-20% free fetal DNA, about 20-21% free fetal DNA, about 21-22% free 
fetal DNA, about 22-23% free fetal DNA, about 23-24% free fetal DNA, about 24-25% free fetal 

30 DNA, about 25-35% free fetal DNA, about 35-45% free fetal DNA, about 45-55% free fetal DNA, 
about 55-65% free fetal DNA, about 65-75% free fetal DNA, about 75-85% free fetal DNA, about 
85-90% free fetal DNA, about 90-91% free fetal DNA, about 91-92% free fetal DNA, about 92- 
93% free fetal DNA, about 93-94% free fetal DNA, about 94-95% free fetal DNA, about 95-96% 
free fetal DNA, about 96-97% free fetal DNA, about 97-98% free fetal DNA, about 98-99% free 

35 fetal DNA, and about 99-99.7% free fetal DNA. 
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In another embodiment, the present invention is directed to a method of using a 
composition comprising free fetal DNA and free maternal DNA for prenatal diagnostics, wherein 
the composition comprises a relationship of free fetal DNA to free maternal DNA including but not 
limited to about 13-15% free fetal DNA, about 15-16% free fetal DNA, about 16-17% free fetal 
5 DNA, about 17-18% free fetal DNA, about 18-19% free fetal DNA, about 19-20% free fetal DNA, 
about 20-21% free fetal DNA, about 21-22% free fetal DNA, about 22-23% free fetal DNA, about 
23-24% free fetal DNA, about 24-25% free fetal DNA, about 25-35% free fetal DNA, about 35- 
45% free fetal DNA, about 45-55% free fetal DNA, about 55-65% free fetal DNA, about 65-75% 
free fetal DNA, about 75-85% free fetal DNA, about 85-90% free fetal DNA, about 90-91% free 

10 fetal DNA, about 91-92% free fetal DNA, about 92-93% free fetal DNA, about 93-94% free fetal 
DNA, about 94-95% free fetal DNA, about 95-96% free fetal DNA, about 96-97% free fetal DNA, 
about 97-98% free fetal DNA, about 98-99% free fetal DNA, or about 99-99.7% free fetal DNA. 

In another embodiment, the present invention is directed to a composition comprising free 
fetal DNA and free maternal DNA, wherein the composition comprises a relationship of free fetal 

15 DNA to free maternal DNA including but not limited a maximum of 13%-15% free fetal DNA, a 
maximum of 15-18% free fetal DNA, a maximum of 18-20% free fetal DNA, a maximum of 20- 
40% free fetal DNA, a maximum of 40-50% free fetal DNA, a maximum of 50-60% free fetal 
DNA, a maximum of 60-70% free fetal DNA, a maximum of 70-80% free fetal DNA, a maximum 
of 80-90% free fetal DNA, a maxium of 90-92% free fetal DNA, a maxium of 92-94% free fetal 

20 DNA, a maximum of 94-95% free fetal DNA, a maxium of 95-96% free fetal DNA, a maxium of 
96-97% free fetal DNA, a maxium of 97-98% free fetal DNA, a maximum of 98-99% free fetal 
DNA, a maxium of 99-99.5% free fetal DNA, and a maxium of 99.5-99.9% free fetal DNA. 

In another embodiment, the present invention is directed to a method of using a 
composition comprising free fetal DNA and free maternal DNA for prenatal diagnostics, wherein 

25 the composition comprises a relationship of free fetal DNA to free maternal DNA including but not 
limited a maximum of 13%-15% free fetal DNA, a maximum of 15-18% free fetal DNA, a 
maximum of 1 8-20% free fetal DNA, a maximum of 20-40% free fetal DNA, a maximum of 40- 
50% free fetal DNA, a maximum of 50-60% free fetal DNA, a maximum of 60-70% free fetal 
DNA, a maximum of 70-80% free fetal DNA, a maximum of 80-90% free fetal DNA, a maxium of 

30 90-92% free fetal DNA, a maxium of 92-94% free fetal DNA, a maximum of 94-95% free fetal 

DNA, a maxium of 95-96% free fetal DNA, a maxium of 96-97% fee fetal DNA, a maxium of 97- 
98% free fetal DNA, a maximum of 98-99% free fetal DNA, a maxium of 99-99.5% free fetal 
DNA, and a maxium of 99.5-99.9% free fetal DNA. 
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DNA Template 

By a "locus of interest" is intended a selected region of nucleic acid that is within a larger 
region of nucleic acid. A locus of interest can include but is not limited to 1-100, 1-50, 1-20, or 
1-10 nucleotides, preferably 1-6, 1-5, 1-4, 1-3, 1-2, or 1 nucleotide(s). 
5 As used herein, an "allele" is one of several alternate forms of a gehe or non-coding regions 

of DNA that occupy the same position on a chromosome. The term allele can be used to describe 
DNA from any organism including but not limited to bacteria, viruses, fungi, protozoa, molds, 
yeasts, plants, humans, non-humans, multi-cellular parasite, animals, and archeabacteria. 

For example, bacteria typically have one large strand of DNA. The term allele with respect 
10 to bacterial DNA refers to the form of a gene found in one cell as compared to the form of the same 
gene in a different bacterial cell of the same species. 

Alleles can have the identical sequence or can vary by a single nucleotide or more than one 
nucleotide. With regard to organisms that have two copies of each chromosome, if both 
chromosomes have the same allele, the condition is referred to as homozygous. If the alleles at the 
15 two chromosomes are different, the condition is referred to as heterozygous. For example, if the 
locus of interest is SNP X on chromosome 1, and the maternal chromosome contains an adenine at 
SNP X (A allele) and the paternal chromosome contains a guanine at SNP X (G allele), the 
individual is heterozygous at SNP X. 

As used herein, sequence means the identity of one nucleotide or more than one contiguous 

20 nucleotides in a polynucleotide. In the case of a single nucleotide, e.g., a SNP, "sequence" and 

* 

"identity" are used interchangeably herein. 

The term "chromosomal abnormality" refers to a deviation between the structure of the 
subject chromosome and a normal homologous chromosome. The term "normal" refers to the 
predominate karyotype or banding pattern found in healthy individuals of a particular species. A 

25 chromosomal abnormality can be numerical or structural, and includes but is not limited to 

aneuploidy, polyploidy, inversion, a trisomy, a monosomy, duplication, deletion, deletion of a part 
of a chromosome, addition, addition of a part of chromosome, insertion, a fragment of a 
chromosome, a region of a chromosome, chromosomal rearrangement, and translocation. A 
chromosomal abnormality can be correlated with presence of a pathological condition or with a 

30 predisposition to develop a pathological condition. As defined herein, a single nucleotide 
polymorphism ("SNP") is not a chromosomal abnormality. 

As used herein, incorporation of a nucleotide by a polymerase is referred to as an 
elongation reaction or a fill-in reaction interchangeably. 

As used herein with respect to individuals, "mutant alleles" refers to variant alleles that are 

35 associated with a disease state. 
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The term "template" refers to any nucleic acid molecule that can be used for amplification 
in the invention. RNA or DNA that is not naturally double stranded can be made into double 
stranded DNA so as to be used as template DNA. Any double stranded DNA or preparation 
containing multiple, different double stranded DNA molecules can be used as template DNA to 
amplify a locus or loci of interest contained in the template DNA. 

The template DNA can be obtained from any source including but not limited to humans, 
non-humans, mammals, reptiles, cattle, cats, dogs, goats, swine, pigs, monkeys, apes, gorillas, bulls, 
cows, bears, horses, sheep, poultry, mice, rats, fish, dolphins, whales, and sharks. 

The template DNA can be from any appropriate sample including but not limited to, nucleic 
acid-containing samples of tissue, bodily fluid (for example, blood, serum, plasma, saliva, urine, 
tears, peritoneal fluid, ascitic fluid, vaginal secretion, breast fluid, breast milk, lymph fluid, 
cerebrospinal fluid or mucosa secretion), umbilical cord blood, chorionic villi, amniotic fluid, an 
embryo, a two-celled embiyo, a four-celled embryo, an eight-celled embryo, a 16-celled embryo, a 
32- celled embryo, a 64-celled embryo, a 128-celled embryo, a 256-celled embryo, a 512-ceiled 
embryo, a 1024-celled embryo, embryonic tissues, lymph fluid, cerebrospinal fluid, mucosa 
secretion, or other body exudate, fecal matter, an individual cell or extract of the such sources that 
contain the nucleic acid of the same, and subcellular structures such as mitochondria, using 
protocols well established within the art. 

In one embodiment, the template DNA can be obtained from a sample of a pregnant female. 
In another embodiment, the template DNA can be obtained from an embryo. In a preferred 
embodiment, the template DNA can be obtained from a single-cell of an embiyo. 

In one embodiment, the template DNA is fetal DNA. Fetal DNA can be obtained from 
sources including but not limited to maternal blood, maternal serum, maternal plasma, fetal cells, 
umbilical cord blood, chorionic villi, amniotic fluid, urine, saliva, cells or tissues. 

In another embodiment, a cell lysis inhibitor is added to the sample including but not 
limited to formaldehyde, formaldehyde derivatives, formalin, glutaraldehyde, glutaraldehyde 
derivatives, primary amine reactive crosslinkers, sulfhydryl reactive crosslinkers, sulfhydryl 
addition or disulfide reduction, carbohydrate reactive crosslinkers, carboxyl reactive crosslinkers, 
photoreactive crosslinkers, cleavable crosslinkers, AEDP, APG, BASED, BM(PEO) 3 , BM(PEO) 4 , 
BMB, BMDB, BMH, BMOE, BS3, BSOCOES, DFDNB, DMA, DMP, DMS, DPDPB, DSG, DSP, 
DSS, DST, DTBP, DTME, DTSSP, EGS, HBVS, sulfo-BSOCOES, Sulfb-DST, Sulfo-EGS or 
compounds listed in Table XXm. In another embodiment, two, three, four, five or more than five 
cell lysis inhibitors can be added to the sample. In a preferred embodiment, formalin is present in 
the sample at a percentage including but not limited to 0.0001-0.03%, 0.03-0.05%, 0.05-0.08%, 
0.08-0.1%, 0.1-0.3%, 0.3-0.5%, 0.5-0.7%, 0.7-0.9%, 0.9-1.2%, 1.2-1.5%, 1.5-2%, 2-3%, 3-5%, and 
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greater than 5%. In another embodiment, any combination of cross-linker, cell membrane 
stabilizer, or cell lysis inhibitor can be added to the sample including but not limited to a cross- 
linker and a cell membrane stabilizer, a cross-linker and a cell lysis inhibitor, and a cell membrane 
stabilizer and a cell lysis inhibitor. More than one cross-linker can be used with more than one cell 

5 membrane stabilizer. More than one cross-linker can be used with more than one cell lysis 

inhibitor. More than one cell membrane stabilizer can be used with more than cell lysis inhibitor. 

In another embodiment, the cell lysis inhibitor is added to the sample such that lysis is less 
than about 10% of the cells. In a preferred embodiment, the cell lysis inhibitor is added to the 
sample such that lysis is less than about 5% of the cells. In a most preferred embodiment, the cell 

10 lysis inhibitor is added to the sample such that lysis is less than about 1% of the cells. 

In another embodiment, a cell membrane stabilizer is added to the sample such that lysis is 
less than about 10% of the cells. In a preferred embodiment, the cell membrane stabilizer is added 
to the sample such that lysis is less than about 5% of the cells. In a most preferred embodiment, the 
cell membrane stabilizer is added to the sample such that lysis is less than about 1% of the cells. 

1 5 in another embodiment, a cross-linker is added to the sample such that lysis is less than 

about 10% of the cells. In a preferred embodiment, the cross-linker is added to the sample such that 
lysis is less than about 5% of the cells. In a most preferred embodiment, the cross-linker is added to 
the sample such that lysis is less than about 1% of the cells. 

In another embodiment, the cell lysis inhibitor, cross-linker or cell membrane stabilizer is 

20 added to the sample in an applicable time period including but not limited to 1-10 seconds, 10-30 
seconds, 30-60 seconds,l-5 minutes, 5-10 minutes, 10-20 minutes, 20-30 minutes, 30-40 minutes, 
40-50 minutes, 60-90 minutes, 90-180 minutes or greater than 180 minutes after collection of the 
sample. In another embodiment, the cell lysis inhibitor, cross-linker, or cell membrane stabilizer is 
present in the apparatus to which the sample is collected including but not limited to a glass tube, a 

25 plastic tube, a circular container, an eppendorf tube, an IV bag, or any other appropriate collection 
device. In another embodiment, after the addition of the cell lysis inhibitor, cell membrane 
stabilizer, or cross-linker, the sample is left at about room temperature for the period of time to 
allow the reagent to function, including but not limited to 1-5, 5-10, 10-20, 2040, 40-60, 60-90, 90- 
120, 120-150, 150-180, 180-240, 240-300 or greater than 300 minutes. 

30 In another embodiment, the template DNA contains both maternal DNA and fetal DNA. In 

a preferred embodiment, template DNA is obtained from blood of a pregnant female. Blood is 
collected using any standard technique for blood-drawing including but not limited to venipuncture. 
For example^ blood can be drawn from a vein from the inside of the elbow or the back of the hand. 
Blood samples can be collected from a pregnant female at any time during fetal gestation. For 

35 example, blood samples can be collected from human females at 1-4, 4-8, 8-12, 12-16, 16-20, 
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20-24, 24-28, 28-32, 32-36, 36-40, or 40-44 weeks of fetal gestation, and preferably between 8-28 
weeks of fetal gestation. 

The blood sample is centrifuged to separate the plasma from the maternal cells. The 
plasma and maternal cell fractions are transferred to separate tubes and re-ceptrifuged. The plasma 
5 fraction contains cell-free fetal DNA and maternal DNA. Any standard DNA isolation technique 
can be used to isolate the fetal DNA and the maternal DNA including but not limited to QIAamp 
DNA Blood Midi Kit supplied by QIAGEN (Catalog number 5 1 1 83). 

In a preferred embodiment, blood can be collected into an apparatus containing a 
magnesium chelator including but not limited to EDTA, and is stored at 4°C. Optionally, a calcium 
1 0 chelator, including but not limited to EGTA, can be added. 

In another embodiment, a cell lysis inhibitor is added to the maternal blood including but 
not limited to formaldehyde, formaldehyde derivatives, formalin, glutaraldehyde, glutaraldehyde 
derivatives, a protein cross-linker, a nucleic acid cross-linker, a protein and nucleic acid cross- 
linker, primary amine reactive crosslinkers, sulfhydryl reactive crosslinkers, sulfydryl addition or 
1 5 disulfide reduction, carbohydrate reactive crosslinkers, carboxyl reactive crosslinkers, photoreactive 
crosslinkers, cleavable crosslinkers, AEDP, APG, BASED, BM(PEO) 3 , BM(PEO) 4 , BMB, BMDB, 
BMH, BMOE, BS3, BSOCOES, DFDNB, DMA, DMP, DMS, DPDPB, DSG, DSP, DSS, DST, 
DTBP, DTME, DTSSP, EGS, HBVS, sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS, or compounds 
listed in Table XXm. 

20 In another embodiment, an agent that stabilizes cell membranes may be added to the 

maternal blood samples to reduce maternal cell lysis including but not limited to aldehydes, urea 
formaldehyde, phenol formaldehyde, DMAE (dimethylaminoethanol), cholesterol, cholesterol 
derivatives, high concentrations of magnesium, vitamin E, and vitamin E derivatives, calcium, 
calcium gluconate, taurine, niacin, hydroxylamine derivatives, bimoclomol, sucrose, astaxanthin, 

25 glucose, amitriptyline, isomer A hopane tetral phenylacetate, isomer B hopane tetral phenylacetate, 
citicoline, inositol, vitamin B, vitamin B complex, cholesterol hemisuccinate, sorbitol, calcium, 
coenzyme Q, ubiquinone, vitamin K, vitamin K complex, menaquinone, zonegran, zinc, ginkgo 
biloba extract, diphenylhydantoin, perftoran, polyvinylpyrrolidone, phosphatidylserine, tegretol, 
PABA, disodium cromglycate, nedocromil sodium, phenytoin, zinc citrate, mexitil, dilantin, sodium 

30 hyaluronate, or polaxamer 188. 

In another embodiment, the template DNA is obtained from the plasma or serum of the 
blood of the pregnant female. The percentage of fetal DNA in maternal plasma is between 
0.39-11.9% (Pertl, andBianchi, Obstetrics and Gynecology 98: 483-490 (2001)). The majority of 
the DNA in the plasma sample is maternal, which makes using the DNA for genotyping the fetus 

35 difficult. However, methods that increase the percentage of fetal DNA in the maternal plasma 
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allow the sequence of the fetal DNA to be determined, and allow for the detection of genetic 
disorders including mutations, insertions, deletions, and chromosomal abnormalities. The addition 
of cell lysis inhibitors, cell membrane stabilizers or cross-linkers to the maternal blood sample can 
increase the relative percentage of fetal DNA. While lysis of both maternal and fetal cells is 
5 inhibited, the vast majority of cells are maternal, and thus by reducing the lysis of maternal cells, 
there is a relative increase in the percentage of free fetal DNA. See Example 4. 

In another embodiment, any blood drawing technique, method, protocol, or equipment that 
reduce the amount of cell lysis can be used, including but not limited to a large boar needle, a 
shorter length needle, a needle coating that increases laminar flow, e.g., teflon, a modification of the 

1 0 bevel of the needle to increase laminar flow, or techniques that reduce the rate of blood flow. The 
fetal cells likely are destroyed in the maternal blood by the mother's immune system. However, it 
is likely that a large portion of the maternal cell lysis occurs as a result of the blood draw or 
processing of the blood sample. Thus, methods that prevent or reduce cell lysis will reduce the 
amount of maternal DNA in the sample, and increase the relative percentage of free fetal DNA. 

15 In another embodiment, an agent that preserves or stabilizes the structural integrity of cells 

can be used to reduce the amount of cell lysis. 

In another embodiment, any protocol that reduces the amount of free maternal DNA in the 
maternal blood can be used prior to obtaining the sample. In another embodiment, prior to 
obtaining the sample, the pregnant female rests without physical activity for a period of time 

20 including but not limited to 0-5, 5-10, 10-15, 15-20, 20-25, 25-30, 30-35, 35-40, 40-45, 45-50, 50- 
55, 55-60, 60-120, 120-180, 180-240, 240-300, 300-360, 360-420, 420-480, 480-540, 540-600, 
600-660, 660-720, 720-780, 780-840, 840-900, 900-1200, 1200-1500, 1500-1800, 1800-2100, 
2100-2400, 2400-2700, 2700-3000, 3000-3300, 3300-3600, 3600-3900, 3900^200, 4200-4500, and 
greater than 4500 minutes. In another embodiment, the sample is obtained from the pregnant 

25 female after her body has reached a relaxed state. The period of rest prior to obtaining the sample 
may reduce the amount of maternal nucleic acid in the sample. In another embodiment, the sample 
is obtained from the pregnant female in the a.m., including but not limited to 4-5 am, 5-6 am, 6-7 
am, 7-8 am, 8-9 am, 9-10 am, 10-1 1 am, and 1 1-12 am. 

In another embodiment, the sample is obtained from the pregnant female after she has slept 

30 for a period of time including but not limited to 0-1, 1-2, 2-3, 3-4, 4-5, 5-6, 6-7, 7-8, 8-9, 9-10, 10- 
11,11-12, or greater than 12 hours. 

In another embodiment, prior to obtaining the sample, the pregnant female exercises for a 
period of time followed by a period of rest In another embodiment, the period of exercise includes 
but is not limited to 0-15, 15-30, 30-45, 45-60, 60-120, 120-240, or greater than 240 minutes. 
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In another embodiment, agents that prevent the destruction of DNA, including but not 
limited to a DNase inhibitor, zinc chloride, ethylenediaminetetraacetic acid, guanidine-HCl, 
guanidine isothiocyanate, N-lauroylsarcosine, and Na-dodecylsulphate, can be added to the blood 
sample. 

5 In another embodiment, fetal DNA is obtained from a fetal cell, wherein said fetal cell can 

be isolated from sources including but not limited to maternal blood, umbilical cord blood, 
chorionic villi, amniotic fluid, embryonic tissues and mucous obtained from the cervix or vagina of 
the mother. 

In a preferred embodiment, fetal cells are isolated from maternal peripheral blood. An 
1 0 antibody specific for fetal cells can be used to purify the fetal cells from the maternal serum 

(Mueller et al., Lancet 336: 197-200 (1990); Ganshirt-Ahlert et al, Am. J. Obstet. Gynecol. 166: 
1350-1355 (1992)). Flow cytometry techniques can also be used to enrich fetal cells (Herzenberg et 
al., PNAS 76: 1453-1455 (1979); Bianchi et al., PNAS87: 3279-3283 (1990); Bruch etal, Prenatal 
Diagnosis 11: 787-798 (1991)). U.S. Pat. No. 5,432,054 also describes a technique for separation 
15 of fetal nucleated red blood cells, using a tube having a wide top and a narrow, capillary bottom 
made of polyethylene. Centrifugation using a variable speed program results in a stacking of red 
blood cells in the capillary based on the density of the molecules. The density fraction containing 
low density red blood cells, including fetal red blood cells, is recovered and then differentially 
hemolyzed to preferentially destroy maternal red blood cells. A density gradient in a hypertonic 
20 medium is used to separate red blood cells, now enriched in the fetal red blood cells from 

lymphocytes and ruptured maternal cells. The use of a hypertonic solution shrinks the red blood 
cells, which increases their density, and facilitate purification from the more dense lymphocytes. 
After the fetal cells have been isolated, fetal DNA can be purified using standard techniques in the 
art. 

25 The nucleic acid that is to be analyzed can be any nucleic acid, e.g., genomic, plasmid, 

cosmid, yeast artificial chromosomes, artificial or man-made DNA, including unique DNA 
sequences, and also DNA that has been reverse transcribed from an RNA sample, such as cDNA. 
The sequence of RNA can be determined according to the invention if it is capable of being made 
into a double stranded DNA form to be used as template DNA. 

30 The terms "primer" and "oligonucleotide primer" are interchangeable when used to discuss 

an oligonucleotide that anneals to a template and can be used to prime the synthesis of a copy of 
that template. 

"Amplified" DNA is DNA that has been "copied" once or multiple times, e.g. by 
polymerase chain reaction. When a large amount of DNA is available to assay, such that a 
35 sufficient number of copies of the locus of interest are already present in the sample to be assayed, 
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it may not be necessary to "amplify" the DNA of the locus of interest into an even larger number of 
replicate copies. Rather, simply "copying" the template DNA once using a set of appropriate 
primers, which may contain hairpin structures that allow the restriction enzyme recognition sites to 
be double stranded, can suffice. 
5 "Copy" as in "copied DNA" refers to DNA that has been copied once, or DNA that has 

been amplified into more than one copy. 

In one embodiment, the nucleic acid is amplified directly in the original sample containing 
the source of nucleic acid. It is not essential that the nucleic acid be extracted, purified or isolated; 
it only needs to be provided in a form that is capable of being amplified. Hybridization of the 

1 0 nucleic acid template with primer, prior to amplification, is not required. For example, 

amplification can be performed in a cell or sample lysate using standard protocols well known in 
the art. DNA that is on a solid support, in a fixed biological preparation, or otherwise in a 
composition that contains non-DNA substances and that can be amplified without first being 
extracted from the solid support or fixed preparation or non-DNA substances in the composition 

15 can be used directly, without further purification, as long as the DNA can anneal with appropriate 
primers, and be copied, especially amplified, and the copied or amplified products can be recovered 
and utilized as described herein. 

In a preferred embodiment, the nucleic acid is extracted, purified or isolated from 
non-nucleic acid materials that are in the original sample using methods known in the art prior to 

20 amplification. 

In another embodiment, the nucleic acid is extracted, purified or isolated from the original 
sample containing the source of nucleic acid and prior to amplification, the nucleic acid is 
fragmented using any number of methods well known in the art including but not limited to 
enzymatic digestion, manual shearing, or sonication. For example, the DNA can be digested with 

25 one or more restriction enzymes that have a recognition site, and especially an eight base or six base 
pair recognition site, which is not present in the loci of interest. Typically, DNA can be fragmented 
to any desired length, including 50, 100, 250, 500, 1,000, 5,000, 10,000, 50,000 and 100,000 base 
pairs long. In another embodiment, the DNA is fragmented to an average length of about 1000 to 
2000 base pairs. However, it is not necessary that the DNA be fragmented. 

30 Fragments of DNA that contain the loci of interest can be purified from the fragmented 

DNA before amplification. Such fragments can be purified by using primers that will be used in the 
amplification (see "Primer Design" section below) as hooks to retrieve the loci of interest, based on 
the ability of such primers to anneal to the loci of interest. In a preferred embodiment, tag-modified 
primers are used, such as e.g. biotinylated primers. 
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By purifying the DNA fragments containing the loci of interest, the specificity of the 
amplification reaction can be improved. This will minimize amplification of nonspecific regions of 
the template DNA. Purification of the DNA fragments can also allow multiplex PCR (Polymerase 
Chain Reaction) or amplification of multiple loci of interest with improved specificity. 
5 The loci of interest that are to be sequenced can be selected based upon sequence alone. In 

humans, over 1.42 million single nucleotide polymorphisms (SNPs) have been described {Nature 
40P:928-933 (2001); The SNP Consortium LTD). On the average, there is one SNP every 1.9 kb of 
human genome. However, the distance between loci of interest need not be considered when 
selecting the loci of interest to be sequenced according to the invention. If more than one locus of 

1 0 interest on genomic DNA is being analyzed, the selected loci of interest can be on the same 
chromosome or on different chromosomes. 

In a preferred embodiment, the selected loci of interest can be clustered to a particular 
region on a chromosome. Multiple loci of interest can be located within a region of DNA such that 
even with any breakage or fragmentation of the DNA, the multiple loci of interest remain linked. 

15 For example, if the DNA is obtained and by natural forces is broken into fragments of 5 Kb, 
multiple loci of interest can be selected within the 5 Kb regions. This allows each fragment, as 
measured by the loci of interest within that fragment, to serve as an experimental unit, and will 
reduce any possible experimental noise of comparing loci of interest on multiple chromosomes. 

The loci of interest on a chromosome can be any distance from each other including but not 

20 limited to 10-50, 50-100, 100-150, 150-200, 200-250, 250-500, 500-750, 750-1000, 1000-1500, 
1500-2000, 2000-2500, 2500-3000, 3000-3500, 3500-4000, 4000-4500, 4500-5000, 5000-10,000 
and greater than 1 0,000 base pairs. 

In a preferred embodiment, the length of sequence that is amplified is preferably different 
for each locus of interest so that the loci of interest can be separated by size. 

25 In fact, it is an advantage of the invention that primers that copy an entire gene sequence 

need not be utilized. Rather, the copied locus of interest is preferably only a small part of the total 
gene or a small part of a non-coding region of DNA. There is no advantage to sequencing the entire 
gene as this can increase cost and delay results. Sequencing only the desired'bases or loci of 
interest maximizes the overall efficiency of the method because it allows for the sequence of the 

30 maximum number of loci of interest to be determined in the fastest amount of time and with 
minimal cost. 

Because a large number of sequences can be analyzed together, the method of the invention 
is especially amenable to the large-scale screening of a number of loci of interest. 

Any number of loci of interest can be analyzed and processed, especially at the same time, 
35 using the method of the invention. The sample(s) can be analyzed to determine the sequence at one 
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locus of interest or at multiple loci of interest at the same time. The loci of interest can be present 

« 

on a single chromosome or on multiple chromosomes. 

Alternatively, 2, 3, 4, 5, 6, 7, 8, 9, 10-20, 20-25, 25-30, 30-35, 35-40, 40-45, 45-50, 50-100, 
100-250, 250-500, 500-1,000, 1,000-2,000, 2,000-3, 000, 3,000-5,000, 5,000-10,000, 
5 10,000-50,000 or more than 50,000 loci of interest can be analyzed at the same time when a global 
genetic screening is desired. Such a global genetic screening might be desired when using the 
method of the invention to provide a genetic fingerprint to identify an individual or for SNP 
genotyping. 

The locus of interest to be copied can be within a coding sequence or outside of a coding 
10 sequence. Preferably, one or more loci of interest that are to be copied are within a gene. In a 

preferred embodiment, the template DNA that is copied is a locus or loci of interest that is within a 
genomic coding sequence, either intron or exon. In a highly preferred embodiment, exon DNA 
sequences are copied. The loci of interest can be sites where mutations are known to cause disease 
or predispose to a disease state. The loci of interest can be sites of single nucleotide 
15 polymorphisms. Alternatively, the loci of interest that are to be copied can be outside of the coding 
sequence, for example, in a transcriptional regulatory region, and especially a promoter, enhancer, 
or repressor sequence. 

Method for Determining the Sequence of a Locus of Interest 

Any method that provides information on the sequence of a nucleic acid can be used 

20 including but not limited to allele specific PCR, PCR, gel electrophoresis, ELISA, mass 

spectrometry, MALDI-TOF mass spectrometry hybridization, primer extension, fluorescence 
detection, fluorescence resonance energy transfer (FRET), fluorescence polarization, DNA 
sequencing, Sanger dideoxy sequencing, DNA sequencing gels, capillary electrophoresis on an 
automated DNA sequencing machine, microchannel electrophoresis, microarray, southern blot, slot 

25 blot, dot blot, single primer linear nucleic acid amplification, as described in U.S. Patent No. 

6,251,639, SNP-IT, GeneChips, HuSNP, BeadArray, TaqMan assay, Invader assay, MassExtend, or 
MassCleave™ (hMC) method. 

The preferred method of determining the sequence has previously been described in U.S. 
Application No. 10/093,618, filed on March 11, 2002, hereby incorporated by reference in its 

30 entirety. 

L Primer Design 

Published sequences, including consensus sequences, can be used to design or select 
primers for use in amplification of template DNA. The selection of sequences to be used for the 
construction of primers that flank a locus of interest can be made by examination of the sequence of 

35 the loci of interest, or immediately thereto. The recently published sequence of the human genome 
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provides a source of useful consensus sequence information from which to design primers to flank a 
desired human gene locus of interest. 

By "flanking" a locus of interest is meant that the sequences of the primers are such that at 
least a portion of the 3 5 region of one primer is complementary to the antisense strand of the 
template DNA and upstream from the locus of interest site (forward primer), and at least a portion 
of the 3 9 region of the other primer iscomplementary to the sense strand of the template DNA and 
downstream of the locus of interest (reverse primer). A "primer pair" is intended a pair of forward 
and reverse primers. Both primers of a primer pair anneal in a manner that allows extension of the 
primers, such that the extension results in amplifying the template DNA in the region of the locus of 
interest. 

Primers can be prepared by a variety of methods including but not limited to cloning of 
appropriate sequences and direct chemical synthesis using methods well known in the art (Narang 
et al, Methods Enzymol 68:90 (1979); Brown et a!., Methods Enzymol 55:109 (1979)). Primers 
can also be obtained from commercial sources such as Operon Technologies, Amersham Pharmacia 
Biotech, Sigma, and Life Technologies. The primers can have an identical melting temperature. 
The lengths of the primers can be extended or shortened at the 5' end or the 3' end to produce 
primers with desired melting temperatures. In a preferred embodiment, one of the primers of the 
prime pair is longer than the other primer. In a preferred embodiment, the 3' annealing lengths of 
the primers, within a primer pair, differ. Also, the annealing position of each primer pair can be 
designed such that the sequence and length of the primer pairs yield the desired melting 
temperature. The simplest equation for determining the melting temperature of primers smaller 
than 25 base pairs is the Wallace Rule (Td = 2(A+T) + 4(G+C)). Computer programs can also be 
used to design primers, including but not limited to Array Designer Software (Arrayit Inc.), 
Oligonucleotide Probe Sequence Design Software for Genetic Analysis (Olympus Optical Co.), 
NetPrimer, and DNAsis from Hitachi Software Engineering. The TM (melting or annealing 
temperature) of each primer is calculated using software programs such as Net Primer (free web 
based program at http://premierbiosoft.com/netprimer/netprlaimch/netprlaunc internet 
address as of April 17, 2002). 

In another embodiment, the annealing temperature of the primers can be recalculated and 
increased after any cycle of amplification, including but not limited to cycle 1, 2, 3, 4, 5, cycles 
6-10, cycles 10-15, cycles 15-20, cycles 20-25, cycles 25-30, cycles 30-35, or cycles 35-40. After 
the initial cycles of amplification, the 5' half of the primers is incorporated into the products from 
each loci of interest, thus the TM can be recalculated based on both the sequences of the 5' half and 
the 3' half of each primer. 
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For example, in FIG. IB, the first cycle of amplification is performed at about the melting 
temperature of the 3 9 region, which anneals to the template DNA, of the second primer (region "c"), 
which is 13 bases. After the first cycle, the annealing temperature can be raised to TM2, which is 
about the melting temperature of the 3 9 region, which anneals to the template DNA, of the first 
5 primer, which is depicted as region "b." The second primer cannot bind to the original template 
DNA because it only anneals to 13 bases in the original DNA template, and TM2 is about the 
melting temperature of approximately 20 bases, which is the 3' annealing region of the first primer 
(FIG. 1C). However, the first primer can bind to the DNA that was copied in the first cycle of the 
. reaction. In the third cycle, the annealing temperature is raised to TM3, which is about the melting 
10 temperature of the entire sequence of the second primer, which is depicted as regions "c" and "d." 
The DNA template produced from the second cycle of PCR contains both regions c 1 and d', and 
therefore, the second primer can anneal and extend at TM3 (FIG. ID). The remaining cycles are 
performed at TM3. The entire sequence of the first primer (a + b') can anneal to the template from 
the third cycle of PCR, and extend (FIG. IE). Increasing the annealing temperature will decrease 
1 5 non-specific binding and increase the specificity of the reaction, which is especially useful if 
amplifying a locus of interest from human genomic DNA, which is about 3x1 0 9 base pairs long. 

As used herein, the term "about" with regard to annealing temperatures is used to 
encompass temperatures within 10 degrees celcius of the stated temperatures. 

In one embodiment, one primer pair is used for each locus of interest. However, multiple 
20 primer pairs can be used for each locus of interest. 

In one embodiment, primers are designed such that one or both primers of the primer pair 
contain sequence in the 5' region for one or more restriction endonucleases (restriction enzyme). 

As used herein, with regard to the position at which restriction enzymes digest DNA, the 
"sense" strand is the strand reading 5 9 to 3* in the direction in which the restriction enzyme cuts. 
25 For example, BsmF I recognizes the following sequences: 

5' GGGAC(N), 0 3' 5' (N)i 4 GTCCC 3' 

3' CCCTG(N) l4 5' 3'(N) l0 CAGGG 5' 

The sense strand is the strand containing the "GGGAC" sequence as it reads 5' to 3' in the 
direction that the restriction enzyme cuts. 
30 As used herein, with regard to the position at which restriction enzymes digest DNA, the 

"antisense" strand is the strand reading 3* to 5* in the direction in which the restriction enzyme cuts. 

In another embodiment, one of the primers in a primer pair is designed such that it contains 
a restriction enzyme recognition site for a restriction enzyme that cuts "n" nucleotides away from 
the recognition site, and produces a recessed 3' end and a 5 s overhang that contains the locus of 
35 interest (herein referred to as a "second primer"). "N" is a distance from the recognition site to the 
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site of the cut by the restriction enzyme. In other words, the second primer of a primer pair contains 
a recognition site for a restriction enzyme that does not cut DNA at the recogiition site but cuts "n" 
nucleotides away from the recognition site. For example, if the recognition sequence is for the 
restriction enzyme BceA I, the enzyme will cut ten (10) nucleotides from the recognition site on the 

5 sense strand, and twelve (12) nucleotides away from the recognition site on the antisense strand. 

The 3 ' region and preferably, the 3 9 half, of the primers is designed to anneal to a sequence 
that flanks the loci of interest (FIG. 1 A). The second primer can anneal any distance from the 
locus of interest provided that digestion with the restriction enzyme that recognizes the restriction 
enzyme recognition site on this primer generates a 5' overhang that contains the locus of interest. 

10 The 5' overhangs can be of any size, including but not limited to 1, 2, 3, 4, 5, 6, 7, 8, and more than 
8 bases. 

In a preferred embodiment, the 3' end of the primer that anneals closer to the locus of 
interest (second primer) can anneal 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, or more than 14 bases 
from the locus of interest or at the locus of interest. 

15 In a preferred embodiment, the second primer is designed to anneal closer to the locus of 

interest than the other primer of a primer pair (the other primer is herein referred to as a "first 
primer"). The second primer can be a forward or reverse primer and the first primer can be a 
reverse or forward primer, respectively. Whether the first or second primer should be the forward 
or reverse primer can be determined by which design will provide better sequencing results. 

20 For example, the primer that anneals closer to the locus of interest can contain a recognition 

site for the restriction enzyme BsmF I, which cuts ten (10) nucleotides from the recognition site on 
the sense strand, and fourteen (14) nucleotides from the recognition site on the antisense strand. In 
this case, the primer can be designed so that the restriction eirzyme recognition site is 13 bases, 12 
bases, 10 bases or 1 1 bases from the locus of interest. If the recognition site is 13 bases from the 

25 locus of interest, digestion with BsmF I will generate a 5 ' overhang (RXXX), wherein the locus of 
interest (R) is the first nucleotide in the overhang (reading 3' to 5'), and X is any nucleotide. If the 
recognition site is 12 bases from the locus of interest, digestion with BsmF I will generate a 5' 
overhang (XRXX), wherein the locus of interest (R) is the second nucleotide in the overhang 
(reading 3' to 5'). If the recognition site is 1 1 bases from the locus of interest, digestion with BsmF 

30 I will generate a 5' overhang (XXRX), wherein the locus of interest (R) is the third nucleotide in the 
overhang (reading 3 ' to 5'). The distance between the restriction enzyme recognition site and the 
locus of interest should be designed so that digestion with the restriction enzyme generates a 5' 
overhang, which contains the locus of interest. The effective distance between the recognition site 
and the locus of interest will vary depending on the choice of restriction enzyme. 
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In another embodiment, the primer that anneals closer to the locus of interest site, relative 
to the other primer, can be designed so that the restriction enzyme that generates the 5' overhang, 
which contains the locus of interest, will see the same sequence at the cut site, independent of the 
nucleotide at the locus of interest site. For example, if the primer that anneals closer to the locus of 
5 interest is designed so that the recognition site for the restriction enzyme BsmF I (5 5 GGGAC 3 *) is 
thirteen bases from the locus of interest, the restriction enzyme will cut the antisense strand one 
base from the locus of interest. The nucleotide at the locus of interest is adjacent to the cut site, and 
may vary from DNA molecule to DNA molecule. If it is desired that the nucleotides adjacent to the 
cut site be identical, the primer can be designed so that the restriction enzyme recognition site for 

1 0 BsmF I is twelve bases away from the locus of interest site. Digestion with BsmF I will generate a 
5* overhang, wherein the locus of interest site is in the second position of the overhang (reading 3* 
to 5*) and is no longer adjacent to the cut site. Designing the primer so that the restriction enzyme 
recognition site is twelve (12) bases from the locus of interest site allows the nucleotides adjacent to 
the cut site to be the same, independent of the nucleotide at the locus of interest Also, primers that 

1 5 have been designed so that the restriction enzyme recognition site, BsmF I, is eleven (1 1) or ten 
(10) bases from the locus of interest site will allow the nucleotides adjacent to the cut site to be the 
same, independent of the nucleotide at the locus of interest. Similar strategies of primer design can 
be, employed with other restriction enzymes so that the nucleotides adjacent to the cut site will be 
the same, independent of the nucleotide at the loci of interest. 

20 The 3 * end of the first primer (either the forward or the reverse) can be designed to anneal 

at a chosen distance from the locus of interest. Preferably, for example, this distance is between 1- 
10, 10-25, 25-50, 50-75, 75-100, 100-150, 150-200, 200-250, 250-300, 300-350, 350-400, 400-450, 
450-500, 500-550, 550-600, 600-650, 650-700, 700-750, 750-800, 800-850, 850-900, 900-950, 
950-1000 and greater than 1000 bases away from the locus of interest. The annealing sites of the 

25 first primers are chosen such that each successive upstream primer is further and further away from 
its respective downstream primer. 

For example, if at locus of interest 1 the V ends of the first and second primers are Z bases 
apart, then at locus of interest 2, the 3* ends of the upstream and downstream primers are Z + K 
bases apart, where K = 1, 2, 3, 4, 5-10, 10-20, 20-30, 30-40, 40-50, 50-60, 60-70, 70-80, 80-90, 

30 90-100, 100-200, 200-300, 300-400, 400-500, 500-600, 600-700, 700-800, fcOO-900, 900-1000, or 
greater than 1000 bases (FIG 2). The purpose of making the first primers further and further apart 
from their respective second primers is so that the PCR products of all the loci of interest differ in 
size and can be separated, e.g., on a sequencing gel. This allows for multiplexing by pooling the 
PCR products in later steps. 
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In one embodiment, the 5' region of the first or second primer can have a recognition site 
for any type of restriction enzyme. In a preferred embodiment, the 5' region of the first and/or 
second primer has at least one restriction enzyme recognition site that is different from the 
restriction enzyme recognition site that is used to generate the 5' overhang, which contains the 
5 locus of interest. 

In one embodiment, the 5' region of the first primer can have a recognition site for any type < - 
of restriction enzyme. In a preferred embodiment, the first primer has at least one restriction 
enzyme recognition site that is different from the restriction enzyme recognition site in the second 
primer. In another preferred embodiment, the first primer anneals further away from the locus of 

1 0 interest than the second primer. 

In a preferred embodiment, the second primer contains a restriction enzyme recognition 
sequence for a Type IIS restriction enzyme including but not limited to BceA I and BsmF I, which 
produce a two base 5' overhang and a four base 5' overhang, respectively. Restriction enzymes that 
are Type IIS are preferred because they recognize asymmetric base sequences (not palindromic like 

15 the orthodox Type II enzymes). Type IIS restriction enzymes cleave DNA at a specified position 
that is outside of the recognition site, typically up to 20 base pairs outside of the recognition site. 
These properties make Type IIS restriction enzymes, and the recognition sites thereof, especially 
useful in the method of the invention. Preferably, the Type IIS restriction enzymes used in this 
method leave a 5' overhang and a recessed 3 \ 

20 A wide variety of Type US restriction enzymes are known and such enzymes have been 

isolated from bacteria, phage, archeabacteria and viruses of eukaryotic algae and are commercially 
available (Promega, Madison WI; New England Biolabs, Beverly, MA; Szybalski W. et aL, Gene 
100: 13-26, 1991). Examples of Type IIS restriction enzymes that would be useful in the method of 
the invention include, but are not limited to enzymes such as those listed in Table I. 



Enzyme-Source 


Recognition/Cleavage Site 


Supplier 


Alwl - Acinetobacter Iwqffii 


GGATC(4/5) 


NE Biolabs 


Alw26 1- Acinetobacter twoffi 


GTCTC(l/5) 


Promega 


Bbsl - Bacillus laterosporus 


GAAGAC(2/6) 


NE Biolabs 


Bbvl - Bacillus brevis 


GCAGC(8/12) 


NE Biolabs 


BceA I - Bacillus cereus 1315 


IACGGC(12/14) 


NE Biolabs 


Bmrl - Bacillus megaterium 


CTGGG(5/4) 


NE Biolabs 


Bsa I - Bacillus stearothermophilus 6-55 


GGTCTC(l/5) 


NE Biolabs 


Bst71 1 ^Bacillus stearothermophilus 71 


GCAGC(8/12) 


Promega 


BsmA I - Bacillus stearothermophilus A664 


GTCTC(l/5) 


NE Biolabs 


BsmB I - Bacillus stearothermophilus B61 


CGTCTC(l/5) 


NE Biolabs 



51 



WO 2004/078994 



PCT7US2004/006337 



BsmF I - Bacillus stearothermophilus F 


GGGAC(10/14) 


NE Biolabs 


BspMI - Bacillus species M 


ACCTGC(4/8) 


NEBiolabs 


Ear I - Enterobacter aerogenes 


CTCTTC(l/4) 


NE Biolabs 


Faul - Flavobacterium aquatile 


CCCGC(4/6) 


NE Biolabs 


Fok I - Flavobacterium okeonokoites 


GGATG(9/13) 


NEBiolabs 


Hgal - Haemophilus gallinarum 


GACGC(5/10) 


NE Biolabs 


Pie I - Pseudomonas lemoignei 


GAGTC(4/5) 


NE Biolabs 


Sap I - Saccharopolyspora species 


GCTCTTC(l/4) 


NE Biolabs 


SfaN I - Streptococcus faecalis ND547 


GCATC(5/9) 


NE Biolabs 


Sthl32 1 - Streptococcus thermophilus ST132 


CCCG(4/8) 


No commercial 
supplier (Gene 
1*95:201-206 
(1997)) 



In one embodiment, a primer pair has sequence at the 5' region of each of the primers that 
provides a restriction enzyme recognition site that is unique for one restriction enzyme. 

In another embodiment, a primer pair has sequence at the 5 9 region of each of the primers 
5 that provide a restriction site that is recognized by more than one restriction enzyme, and especially 
for more than one Type US restriction enzyme. For example, certain consensus sequences can be 
recognized by more than one enzyme. For example, Bsgl, Eco571 and Bpml all recognize the 
consensus (G/C)TGnAG and cleave 16 by away on the antisense strand and 14 by away on the 
sense strand. A primer that provides such a consensus sequence would result in a product that has a 
1 0 site that can be recognized by any of the restriction enzymes Bsgl, Eco571 and Bpml. 

Other restriction enzymes that cut DNA at a distance from the recognition site, and produce 
a recessed 3 5 end and a 5' overhang include Type in restriction enzymes. 

For example, the restriction enzyme EcoP15I recognizes the sequence 5' CAGCAG 3' and 
cleaves 25 bases downstream on the sense strand and 27 bases on the antisense strand. It will be 
15 farther appreciated by a person of ordinary skill in the art that new restrictioji enzymes are 
continually being discovered and can readily be adopted for use in the subject invention. 

In another embodiment, the second primer can contain a portion of the recognition 
sequence for a restriction enzyme, wherein the full recognition site for the restriction enzyme is 
generated upon amplification of the template DNA such that digestion with the restriction enzyme 
20 generates a 5' overhang containing the locus of interest. For example, the recognition site for BsmF 
I is 5 ' GGGACNio* 3 ' . The 3 * region, which anneals to the template DNA, of the second primer 
can end with the nucleotides "GGG," which do not have to be complementary with the template 
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DNA. If the 3' annealing region is about 10-20 bases, even if the last three bases do not anneal, the 
primer will extend and, generate a BsmF I site. 

Second primer: 5 9 GGAAATTCCATGATGCGTGGG-> 
Template DNA 3' CCTTTAAGGTACTACGCAN1N2N3TG 5' 
5 5' GGAAATTCCATGATGCCTN r N2'N 3 «AC 3' 

The second primer can be designed to anneal to the template DNA, wherein the next two 
bases of the template.DNA are thymidine and guanine, such that an adenosine and cytosine are 
incorporated into the primer forming a recognition site for BsmF I, 5' GGGACNio* 3*. The second 
primer can be designed to anneal in such a manner that digestion with BsmF I generates a 5' 
1 0 overhang containing the locus of interest 

In another embodiment, the second primer can contain an entire or full recognition site for a 
restriction enzyme or a portion of a recognition site, which generates a full recognition site upon 
primer-dependent replication of the template DNA such that digestion with a restriction enzyme 
that cuts at the recognition site and generates a 5' overhang that contains the locus of interest. For 
15 example, the restriction enzyme BsaJ I binds the following recognition site: 5* C^CN^GG 3'. 
The second primer can be designed such that the 3' region, which anneals to the template DNA of 
the primer ends with "CC", the SNP of interest is represented by "Ni", and the template sequence 
downstream of the SNP is "N 2 GG." 

Second primer: 5' GGAAATTCCATGATGCGTACC— > 
20 Template DNA 3' CCTTTAAGGTACTACGCATGGNiN 2 CC 5' 

5' GGAAATTCCATGATGCCTACCN r N 2 .GG 3' 
After digestion with BsaJ I, a 5' overhang of the following sequence would be generated: 
5'C 3' 
3'GGNiN 2 CC5' 

25 If the nucleotide guanine is not reported at the locus of interest, the 3 ' recessed end can be 

filled in with unlabeled cytosine, which is complementary to the first nucleotide in the overhang. 
After removing the excess cytosine, labeled ddNTPs can be used to fill in the next nucleotide, Ni, 
which represents the locus of interest. Other restriction enzymes can be used including but not 
limited to BssK I (5' ^CCNGG 3*), Dde 1(5' C*TNAG 3'), EcoN I (5' CCTNN i NNNAGG 3'), 

30 Fnu4H I (5* GC'NGC 3'), Hinf I (5' G i ANTC 3') PflF I (5' GACN'NNGTC 3'), Sau96 1(5' 
G l GNCC 3'), ScrF I (5' CC ; NGG 3'), and Tthl 1 1 1 (5' GACN^NNGTC 3'). 

It is not necessary that the 3 * region, which anneals to the template DNA, of the second 
primer be 100% complementary to the template DNA. For example, the last 1, 2, or 3 nucleotides 
of the 3' end of the second primer can be mismatches with the template DNA. The region of the 

35 primer that anneals to the template DNA will target the primer, and allow the primer to extend. 
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Even if the last two nucleotides are not complementary to the template DNA, the primer will extend 
and generate a restriction enzyme recognition site. For example, the last two nucleotides in the 
second primer are "CC " The second primer anneals to the template DNA, and allows extension 
even if "CC" is not complementary to the nucleotides Na, and Nb, on the template DNA. 
5 Second primer: 5' GGAAATTCCATGATGCGTACC— ► 

Template DNA 3' CCTTTAAGGTACTACGCATN a »N b »Ni.N r CC 5' 
5' GGAAATTCCATGATGCCTANaNbNiNjGG 3' 
After digestion with BsaJ I, a 5* overhang of the following sequence would be generated: 
5'C . 3' 
10 3'GGN,N 2 CC5' 

If the nucleotide guanine is not reported at the locus of interest, the 5' overhang can be 
filled in with unlabeled cytosine. The excess cytosine can be rinsed away, and filled in with labeled 
ddNTPs. The first nucleotide incorporated (Ni') corresponds to the locus of interest. If guanine is 
reported at the locus of interest, the loci of interest can be filled in with unlabeled cytosine and a 
15 nucleotide downstream of the locus of interest can be detected. For example, assume N 2 is adenine. 
If the locus of interest is guanine, unlabeled cytosine can be used in the fill in reaction. After 
removing the cytosine, a fill in reaction with labeled thymidine can be used. The labeled thymidine 
will be incorporated only if the locus of interest was a guanine. Thus, the sequence of the locus of 
interest can be determined by detecting a nucleotide downstream of the locus of interest. 
20 In another embodiment, the first and second primers contain a portion of a recognition 

sequence for a restriction enzyme, wherein the full recognition site for the restriction enzyme is 
generated upon amplification of the template DNA such that digestion with the restriction enzyme 
generates a 5' overhang containing the locus of interest. The recognition site for any restriction 
enayme that contains one or more than one variable nucleotide can be generated including but not 
25 limited to the restriction enzymes BssK I (5' 4 CCNGG 3'), Dde I (5'C l TNAG 3'), Econ I 
(5 , CCTNN i NNNAGG 3'), Fnu4H I (5'GC'NGC 3'), Hinf I (S'G'ANTC 3*), PflF I (5' 
GACN'NNGTC 3'), Sau96 1 (5' G A GNCC 3'), ScrF I (5' CC'NGG 3'), and Tthl 11 1 (5* 
GACN 4 NNGTC 3'). 

In a preferred embodiment, the 3' regions of the first and second primers contain the partial 
30 sequence for a restriction enzyme, wherein the partial sequence contains 1, 2, 3, 4 or more than 4 
mismatches with the template DNA; these mismatches create the restriction enzyme recognition 
site. The number of mismatches that can be tolerated at the 3* end depends on the length of the 
primer. For example, if the locus of interest is represented by Ni, a first primer can be designed to 
be complementary to the template DNA, depicted below as region "a." The 3' region of the first 
35 primer ends with "CC," which is not complementary to the template DNA. The second primer is 

54 



WO 2004/078994 



PCT/US2004/006337 



designed to be complementary to the template DNA, which is depicted below as region "b' The 
3* region of the second primer ends with "CC," which is not complementary to the template DNA. 

First primer 5' a CO» 

Template DNA 3' a' AANyNrTT V 5' 

5' a TTNiN 2 AA b 3' 

<— CC b' 5 ' Second primer 

After one round of amplification the following products would be generated: 
5' _a__CCNiN 2 AA b 3' 

and 

5' b CCN 2 .Ni-AA a* 3 '. 

In cycle two, the primers can anneal to the templates that were generated from the first 
cycle of PCR: 

5' a CCNiN 2 AA b 3' 

<-CC b' 5' 
<-CC a 5' 

5' b' C CN2N1AA a' 3 ' 

After cycle two of PCR, the following products would be generated: 
5' a CCN^GG b 3 ' 

3' a' GGN,N 2 CC b' 5' 

The restriction enzyme recognition site for BsaJ I is generated, and after digestion with 
BsaJ I, a 5' overhang containing the locus of interest is created. The locus of interest can be 
detected as described in detail below. 

In another embodiment, a primer pair has sequence at the 5' region of each of the primers 
that provides two or more restriction sites that are recognized by two or more restriction enzymes. 

In a most preferred embodiment, a primer pair has different restriction enzyme recognition 
sites at the 5' regions, especially 5' ends, such that a different restriction enzyme is required to 
cleave away any undesired sequences. For example, the first primer for locus of interest "A" can 
contain sequence recognized by a restriction enzyme, "X," which can be any type of restriction 
enzyme, and the second primer for locus of interest "A," which anneals closer to the locus of 
interest, can contain sequence for a restriction enzyme, "Y," which is a Type IIS restriction enzyme 
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that cuts "n" nucleotides away and leaves a 5'overhang and a recessed 3' end. The 5' overhang 
contains the locus of interest. After binding the amplified DNA to streptavidin coated wells, one 
can digest with enzyme "Y," rinse, then fill in with labeled nucleotides and rinse, and then digest 
with restriction enzyme "X," which will release the DNA fragment containing the locus of interest 
5 from the solid matrix. The locus of interest can be analyzed by detecting the- labeled nucleotide that 
was "filled in" at the locus of interest, e.g. SNP site. 

In another embodiment, the second primers for the different loci of interest that are being 
amplified according to the invention contain recognition sequence in the 5' regions for the same 
restriction enzyme and likewise all the first primers also contain the same restriction enzyme 

10 recognition site, which is a different enzyme from the enzyme that recognizes the second primers. 

In another embodiment, the second primers for the multiple loci of interest that are being 
amplified according to the invention contain restriction enzyme recognition sequences in the 5' 
regions for different restriction enzymes. 

In another embodiment, the first primers for the multiple loci of interest that are being 

1 5 amplified according to the invention contain restriction enzyme recognition sequences in the 5 9 
regions for different restriction enzymes. Multiple restriction enzyme sequences provide an 
opportunity to influence the order in which pooled loci of interest are released from the solid 
support. For example, if 50 loci of interest are amplified, the first primers can have a tag at the 
extreme 5' end to aid in purification and a restriction enzyme recognition site, and the second 

20 primers can contain a recognition site for a type IIS restriction enzyme. For example, several of the 
first primers can have a restriction enzyme recognition site for EcoR I, other first primers can have 
a recognition site for Pst I, and still other first primers can have a recognition site for BamH I. 
After amplification, the loci of interest can be bound to a solid support with the aid of the tag on the 
first primers. By performing the restriction digests one restriction enzyme at a time, one can 

25 serially release the amplified loci of interest. If the first digest is performed with EcoR I, the loci of 
interest amplified with the first primers containing the recognition site for EcoR I will be released, 
and collected while the other loci of interest remain bound to the solid support. The amplified loci 
of interest can be selectively released from the solid support by digesting with one restriction 
enzyme at a time. The use of different restriction enzyme recognition sites in the first primers 

30 allows a larger number of loci of interest to be amplified in a single reaction tube. 

In a preferred embodiment, any region 5' of the restriction enzyme digestion site of each 
primer can be modified with a functional group that provides for fragment manipulation, 
processing, identification, and/or purification. Examples of such functional groups, or tags, include 
but are not limited to biotin, derivatives of biotin, carbohydrates, haptens, dyes, radioactive 

35 molecules, antibodies, and fragments of antibodies, peptides, and immunogenic molecules. 
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In another embodiment, the template DNA can be replicated once, without being amplified 
beyond a single round of replication. This is useful when there is a large amount of the DNA 
available for analysis such that a large number of copies of the loci of interest are already present in 
the sample, and further copies are not needed. In this embodiment, the primers are preferably 
5 designed to contain a "hairpin" structure in the 5' region, such that the sequence doubles back and 
anneals to a sequence internal to itself in a complementary manner. When the template DNA is 
replicated only once, the DNA sequence comprising the recognition site would be single-stranded if 
not for the "hairpin" structure. However, in the presence of the hairpin structure, that region is 
effectively double stranded, thus providing a double stranded substrate for activity by restriction 
10 enzymes. 

To the extent that the reaction conditions are compatible, all the primer pairs to analyze a 
locus or, loci of interest of DNA can be mixed together for use in the method of the invention. In a 
preferred embodiment, all primer pairs are mixed with the template DNA in a single reaction vessel. 
Such a reaction vessel can be, for example, a reaction tube, or a well of a microliter plate. 

15 Alternatively, to avoid competition for nucleotides and to minimize primer dimers and 

difficulties with annealing temperatures for primers, each locus of interest or small groups of loci of 
interest can be amplified in separate reaction tubes or wells, and the products later pooled if desired. 
For example, the separate reactions can be pooled into a single reaction vessel before digestion with 
the restriction enzyme that generates a 5' overhang, which contains the locus of interest or SNP site, 

20 and a 3' recessed end. Preferably, the primers of each primer pair are provided in equimolar 

amounts. Also, especially preferably, each of the different primer pairs is provided in equimolar 
amounts relative to the other pairs that are being used. 

In another embodiment, combinations of primer pairs that allow efficient amplification of 
their respective loci of interest can be used (see e.g. FIG. 2). Such combinations can be determined 

25 prior to use in the method of the invention. Multi-well plates and PCR machines can be used to 
select primer pairs that work efficiently with one another. For example, gradient PCR machines, 
such as the Eppendorf Mastercycler® gradient PCR machine, can be used to select the optimal 
annealing temperature for each primer pair. Primer pairs that have similar properties can be used 
together in a single reaction tube. 

30 In another embodiment, a multi-sample container including but not limited to a 96-well or 

more plate can be used to amplify a single locus of interest with the same primer pairs from 
multiple template DNA samples with optimal PCR conditions for that locus of interest. 
Alternatively, a separate multi-sample container can be used for amplification of each locus of 
interest and the products for each template DNA sample later pooled. For example, gene A from 96 

35 different DNA samples can be amplified in microtiter plate 1 , gene B from 96 different DNA 
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samples can be amplified in microtiter plate 2, etc., and then the amplification products can be 
pooled. 

The result of amplifying multiple loci of interest is a preparation that contains 
representative PCR products having the sequence of each locus of interest. For example, if DNA 
5 from only one individual is used as the template DNA and if hundreds of disease-related loci of 
interest were amplified from the template DNA, the amplified DNA would be a mixture of small, 
PCR products from each of the loci of interest. Such a preparation could be further analyzed at that 
time to determine the sequence at each locus of interest or at only some loci of interest. 
Additionally, the preparation could be stored in a manner that preserves the DNA and can be 

10 analyzed at a later time. Information contained in the amplified DNA can be revealed by any 

suitable method including but not limited to fluorescence detection, sequencing, gel electrophoresis, 
and mass spectrometry (see "Detection of Incorporated Nucleotide" section below). 
II. Amplification of Loci of Interest 

The template DNA can be amplified using any suitable method known in the art including 

1 5 but not limited to PCR (polymerase chain reaction), 3 SR (self-sustained sequence reaction), LCR 
(ligase chain reaction), RACE-PCR (rapid amplification of cDNA ends), PLCR (a combination of 
polymerase chain reaction and ligase chain reaction), Q-beta phage amplification (Shah et at, J. 
Medical Micro. 33: 1435-41 (1995)), SDA (strand displacement amplification), SOE-PCR (splice 
overlap extension PCR), and the like. These methods can be used to design variations of the 

20 releasable primer mediated cyclic amplification reaction explicitly described in this application. In 

♦ 

the most preferred embodiment, the template DNA is amplified using PCR (PCR: A Practical 
Approach, M. J. McPherson, et al., IRL Press (1991); PCR Protocols: A Guide to Methods and 
Applications, Innis, et al., Academic Press (1990); and PCR Technology: Principals and 
Applications of DNA Amplification, H. A. Erlich, Stockton Press (1989)). PCR is also described in 

25 numerous U.S. patents, including U.S. Pat. Nos. 4,683,195; 4,683,202; 4,800,159; 4,965,188; 
4,889,818; 5,075,216; 5,079,352; 5,104,792, 5,023,171; 5,091,310; and 5,066,5 84. 

The components of a typical PCR reaction include but are not limited to a template DNA, 
primers, a reaction buffer (dependent on choice of polymerase), dNTPs (dATP, dTTP, dGTP, and 
dCTP) and a DNA polymerase. Suitable PCR primers can be designed and prepared as discussed 

30 above (see "Primer Design" section above). Briefly, the reaction is heated to 95°C for 2 min. to 
separate the strands of the template DNA, the reaction is cooled to an appropriate temperature 
(determined by calculating the annealing temperature of designed primers) to allow primers to 
anneal to the template DNA, and heated to 72°C foT two minutes to allow extension. 

In a preferred embodiment, the annealing temperature is increased in each of the first three 

35 cycles of amplification to reduce non-specific amplification. See also Example 1, below. The TM1 
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of the first cycle of PCR is about the melting temperature of the 3* region of the second primer that 
anneals to the template DNA. The annealing temperature can be raised in cycles 2-10, preferably in 
cycle 2, to TM2, which is about the melting temperature of the 3 1 region, which anneals to the 
template DNA, of the first primer. If the annealing temperature is raised in cycle 2, the annealing 
5 temperature remains about the same until the next increase in annealing temperature. Finally, in 
any cycle subsequent to the cycle in which the annealing temperature was increased to TM2, 
preferably cycle 3, the annealing temperature is raised to TM3, which is about the melting 
temperature of the entire second primer. After the third cycle, the annealing temperature for the 
remaining cycles can be at about TM3 or can be further increased. In this example, the annealing 

10 temperature is increased in cycles 2 and 3. However, the annealing temperature can be increased 
from a low annealing temperature in cycle 1 to a high annealing temperature in cycle 2 without any 
further increases in temperature or the annealing temperature can progressively change from a low 
annealing temperature to a high annealing temperature in any number of incremental steps. For 
example, the annealing temperature can be changed in cycles 2, 3, 4, 5, 6, etc. 

15 After annealing, the temperature in each cycle is increased to an "extension" temperature to 

allow the primers to "extend" and then following extension the temperature in each cycle is 
increased to the denaturization temperature. For PCR products less than 500 base pairs in size, one 
can eliminate the extension step in each cycle and just have denaturization and annealing steps. A 
typical PCR reaction consists of 25-45 cycles of denaturation, annealing and extension as described 

20 above. However, as previously noted, one cycle of amplification (one copy) can be sufficient for 
practicing the invention. 

In another embodiment, multiple sets of primers wherein a primer set comprises a forward 
primer and a reverser primer, can be used to amplify the template DNA for 1-5, 5-10, 10-15, 15-20 
or more than 20 cycles, and then the amplified product is further amplified in a reaction with a 

25 single primer set or a subset of the multiple primer sets. In a preferred embodiment, a low 

concentration of each primer set is used to minimize primer-dimer formation. A low concentration 
of starting DNA can be amplified using multiple primer sets. Any number of primer sets can be 
used in the first amplification reaction including but not limiting to 1-10, 10-20, 20-30, 30-40, 
40-50, 50-60, 60-70, 70-80, 80-90, 90-100, 100-150, 150-200, 200-250, 250-300, 300-350, 

30 350-400, 400-450, 450-500, 500-1000, and greater than 1000. In another embodiment, the 
amplified product is amplified in a second reaction with a single primer set. In another 
embodiment, the amplified product is further amplified with a subset of the multiple primer pairs 
including but not limited to 2-10, 10-20, 20-30, 30-40, 40-50, 50-60, 60-70, 70-80, 80-90, 90-100, 
100-150, 150-200, 200-250, and more than 250. 
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The multiple primer sets will amplify the loci of interest, such that a minimal amount of 
template DNA is not limiting for the number of loci that can be detected. For example, if template 
DNA is isolated from a single cell or the template DNA is obtained from a pregnant female, which 
comprises both maternal template DNA and fetal template DNA, low concentrations of each primer 
set can be used in a first amplification reaction to amplify the loci of interest. The low 
concentration of primers reduces the formation of primer-dimer and increases the probability that 
the primers will anneal to the template DNA and allow the polymerase to extend. The optimal 
number of cycles performed with the multiple primer sets is determined by the concentration of the 
primers. Following the first amplification reaction, additional primers can be added to further 
amplify the loci of interest. Additional amounts of each primer set can be afdded and further 
amplified in a single reaction. Alternatively, the amplified product can be further amplified using a 
single primer set in each reaction or a subset of the multiple primers sets. For example, if 150 
primer sets were used in the first amplification reaction, subsets of 10 primer sets can be used to 
further amplify the product from the first reaction. 

Any DNA polymerase that catalyzes primer extension can be used including but not limited 
to E. coli DNA polymerase, Klenow fragment of E. coli DNA polymerase 1, T7 DNA polymerase, 
T4 DNA polymerase, Taq polymerase, Pfii DNA polymerase, Vent DNA polymerase, 
bacteriophage 29, REDTaq™ Genomic DNA polymerase, or sequenase. Preferably, a thermostable 
DNA polymerase is used. A "hot start" PCR can also be performed wherein the reaction is heated 
to 95°C for two minutes prior to addition of the polymerase or the polymerase can be kept inactive 
until the first heating step in cycle 1 . "Hot start" PCR can be used to minimize nonspecific 
amplification. Any number of PCR cycles can be used to amplify the DNA, including but not 
limited to 2, 5, 10, 15, 20, 25, 30, 35, 40, or 45 cycles. In a most preferred embodiment, the number 
of PCR cycles performed is such that equimolar amounts of each loci of interest are produced, 
m. Purification of Amplified DNA 

Purification of the amplified DNA is not necessaiy for practicing the invention. However, 
in one embodiment, if purification is preferred, the 5' end of the primer (first or second primer) can 
be modified with a tag that facilitates purification of the PCR products. In a preferred embodiment, 
the first primer is modified with a tag that facilitates purification of the PCR products. The 
modification is preferably the same for all primers, although different modifications can be used if 
it is desired to separate the PCR products into different groups. 

The tag can be any chemical moiety including but not limited to a radioisotope, fluorescent 
reporter molecule, chemiluminescent reporter molecule, antibody, antibody fragment, hapten, 
biotin, derivative of biotin, photobiotin, iminobiotin, digoxigenin, avidin, enzyme, acridinium, 
sugar, enzyme, apoenzyme, homopolymeric oligonucleotide, hormone, ferromagnetic moiety, 
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paramagnetic moiety, diamagnetic moiety, phosphorescent moiety, luminescent moiety, 
electrochemiluminescent moiety, chromatic moiety, moiety having a detectable electron spin 
resonance, electrical capacitance, dielectric constant or electrical conductivity, or combinations 
thereof. 

5 As one example, the 5' ends of the primers can be biotinylated (Kandpal et al, Nucleic 

Acids Res. 78:1789-1795 (1990); Kaneoka et al, Biotechniques 70:30-34 (1991); Green et al, 

Nucleic Acids Res. 75:6163-6164 (1990)). The biotin provides an affinity feig that can be used to 

purify the copied DNA from the genomic DNA or any other DNA molecules that are not of interest. 

Biotinylated molecules can be purified using a streptavidin coated matrix as shown in FIG. IF, 
1 0 including but not limited to Streptawell, transparent, High-Bind plates from Roche Molecular 

Biochemicals (catalog number 1 645 692, as listed in Roche Molecular Biochemicals, 2001 

Biochemicals Catalog). 

The PCR product of each locus of interest is placed into separate wells of a Streptavidin 

coated plate. Alternatively, the PCR products of the loci of interest can be pooled and placed into a 
15 streptavidin coated matrix, including but not limited to the Streptawell, transparent, High-Bind 

plates from Roche Molecular Biochemicals (catalog number 1 645 692, as listed in Roche 

Molecular Biochemicals, 2001 Biochemicals Catalog). 

The amplified DNA can also be separated from the template DNA using non-affinity 

methods known in the art, for example, by polyacrylamide gel electrophoresis using standard 
20 protocols. 

IV. Digestion of Amplified DNA 

The amplified DNA can be digested with a restriction enzyme that recognizes a sequence 

that had been provided on the first or second primer using standard protocols known within the art 

(FIGS. 6A-6D). Restriction enzyme digestions are performed using standard protocols well known 
25 within the art. The en2yme used depends on the restriction recognition site generated with the first 

or second primer. See "Primer Design" section, above, for details on restriction recognition sites 

generated on primers. 

Type IIS restriction enzymes are extremely useful in that they cut approximately 10-20 base 
pairs outside of the recognition site. Preferably, the Type IIS restriction enzymes used are those 

30 that generate a 5' overhang and a recessed V end, including but not limited to BceA I and BsmF I 
(see e.g. Table 1). In a most preferred embodiment, the second primer (either forward or reverse) 
contains a restriction enzyme recognition sequence for BsmF I or BceA I. The Type IIS restriction 
enzyme BsmF I recognizes the nucleic acid sequence GGGAC, and cuts 14 nucleotides from the 
recognition site on the antisense strand and 10 nucleotides from the recognition site on the sense 

35 strand. Digestion with BsmF I generates a 5' overhang of four (4) bases. 
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For example, if the second primer is designed so that after amplification the restriction 
enzyme recognition site is 13 bases from the locus of interest, then after digestion, the locus of 
interest is the first base in the 5' overhang (reading 3' to 5'), and the recessed 3 s end is one base 
from the locus of interest. The 3 9 recessed end can be filled in with a nucleotide that is 
5 complementary to the locus of interest. One base of the overhang can be filled in using 
dideoxynucleotides. However, 1, 2, 3, or 4 bases of the overhang can be filled in using 
deoxymicleotides or a mixture of dideoxynucleotides and deoxynucleotides. 

The restriction enzyme BsmF I cuts DNA ten(10) nucleotides from the recognition site on 
the sense strand and fourteen (14) nucleotides from the recognition site on the antisense strand. 

10 However, in a sequence dependent manner, the restriction enzyme BsmF I also cuts eleven (1 1) 
nucleotides from the recognition site on the sense strand and fifteen (15) nucleotides from the 
recognition site on the antisense strand. Thus, two populations of DNA molecules exist after 
digestion: DNA molecules cut at 10/14 and DNA molecules cut at 1 1/15. If the recognition site for 
BsmF I is 13 bases from the locus of interest in the amplified product, then DNA molecules cut at 

15 the 1 1/15 position will generate a 5' overhang that contains the locus of interest in the second 

position of the overhang (reading 3' to 5'). The 3' recessed end of the DNA molecules can be filled 
in with labeled nucleotides. For example, if labeled dideoxynucleotides are used, the 3' recessed 
end of the molecules cut at 1 1/15 would be filled in with one base, which corresponds to the base 
upstream from the locus of interest, and the 3' recessed end of molecules cttt at 10/14 would be 

20 filled in with one base, which corresponds to the locus of interest. The DNA molecules that have 
been cut at the 10/14 position and the DNA molecules that have been cut at the 1 1/15 position can 
be separated by size, and the incorporated nucleotides detected. This allows detection of both the 
nucleotide before the locus of interest, detection of the locus of interest, and potentially the three 
bases after the locus of interest. 

25 Alternatively, if the base upstream from the locus of interest and the locus of interest are 

different nucleotides, then the 3 ' recessed end of the molecules cut at 1 1/15 can be filled in with 
deoxynucleotide that is complementary to the upstream base. The remaining deoxynucleotide is 
washed away, and the locus of interest site can be filled in with either labeled deoxynucleotides, 
unlabeled deoxynucleotides, labeled dideoxynucleotides, or unlabeled dideoxynucleotides. After 

30 the fill in reaction, the nucleotide can be detected by any suitable method. Thus, after the first fill in 
reaction with dNTP, the 3* recessed end of the molecules cut at 10/14 and 11/15 is upstream from 
the locus of interest. The 3' recessed end can now be filled in one base, which corresponds to the 
locus of interest, two bases, three bases or four bases. 

The restriction enzyme BceA I recognizes the nucleic acid sequence ACGGC and cuts 12 

35 (twelve) nucleotides from the recognition site on the sense strand and 14 (fourteen) nucleotides 
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from the recognition site on the antisense strand. If the distance from the recognition site for BceA 
I on the second primer is designed to be thirteen (13) bases from the locus of interest (see FIGS. 
4A-4D), digestion with BceA I will generate a 5' overhang of two bases, which contains the locus 
of interest, and a recessed 3' end that is upstream from the locus of interest The locus of interest is 
5 the first nucleotide in the 5' overhang (reading 3' to 5'). 

Alternative cutting is also seen with the restriction enzyme BceA I, although at a much 
lower frequency than is seen with BsmF I. The restriction enzyme BceA I can cut thirteen (1 3) 
nucleotides from the recognition site on the sense strand and fifteen (15) nucleotides from the 
recognition site on the antisense strand. Thus, two populations of DNA molecules exist: DNA 

10 molecules cut at 12/14 and DNA molecules cut at 13/15. If the restriction enzyme recognition site 
is 13 bases from the locus of interest in the amplified product, DNA molecules cut at the 13/15 
position yield a 5' overhang, which contains the locus of interest in the second position of the 
overhang (reading 3' to 5'). Labeled dideoxynucleotides can be used to fill in the 3' recessed end 
of the DNA molecules. The DNA molecules cut at 13/15 will have the base upstream from the 

15 locus of interest filled in, and the DNA molecules cut at 12/14 will have the locus of interest site 
filled in. The DNA molecules cut at 13/15 and those cut at 12/14 can be separated by size, and the 
incorporated nucleotide detected. Thus, the alternative cutting can be used to obtain additional 
sequence information. 

Alternatively, if the two bases in the 5' overhang are different, the 3' recessed end of the 

20 DNA molecules, which were cut at 13/15, can be filled in with the deoxynucleotide complementary 
to the first base in the overhang, and excess deoxynucleotide washed away. After filling in, the 3' 
recessed end of the DNA molecules that were cut at 12/14 and the DNA molecules that were cut at 
13/15 are upstream from the locus of interest The 3' recessed ends can be filled with either labeled 
dideoxynucleotides, unlabeled dideoxynucleotides, labeled deoxynucleotides, or unlabeled 

25 deoxynucleotides. 

If the primers provide different restriction sites for certain of the loci of interest that were 
copied, all the necessary restriction enzymes can be added together to digest the copied DNA 
simultaneously. Alternatively, the different restriction digests can be made in sequence, for 
example, using one restriction enzyme at a time, so that only the product that is specific for that 

30 restriction enzyme is digested. 

Optimal restriction enzyme digestion conditions, including but not limited to the 
concentration of enzyme, temperature, buffer conditions, and the time of digestion can be optimized 
for each restriction enzyme. For example, the alternative cutting seen with the type IIS restriction 
enzyme BsmF I can be reduced, if desired, by performing the restriction enzyme digestion at lower 

35 temperatures including but not limited to 25-16°, 16-12°C, 12-8°C, 8-4°C, or 4-0°C. 
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V. Incorporation of Labeled Nucleotides 

Digestion with the restriction enzyme that recognizes the sequence on the second primer 
generates a recessed 3' end and a 5' overhang, which contains the locus of interest (FIG. 1G). The 
recessed 3' end can be filled in using the 5' overhang as a template in the presence of unlabeled or 
5 labeled nucleotides or a combination of both unlabeled and labeled nucleotides. The nucleotides 
can be labeled with any type of chemical group or moiety that allows for detection including but not 
limited to radioactive molecules, fluorescent molecules, antibodies, antibody fragments, haptens, 
carbohydrates, biotin, derivatives of biotin, phosphorescent moieties, luminescent moieties, 
electrochemiluminescent moieties, chromatic moieties, and moieties having a detectable electron 

10 spin resonance, electrical capacitance, dielectric constant or electrical conductivity. The 

nucleotides can be labeled with one or more than one type of chemical group or moiety. Each 
nucleotide can be labeled with the same chemical group or moiety. Alternatively, each different 
nucleotide can be labeled with a different chemical group or moiety. The labeled nucleotides can 
be dNTPs, ddNTPs, or a mixture of both dNTPs and ddNTPs. The unlabeled nucleotides can be 

15 dNTPs, ddNTPs or a mixture of both dNTPs and ddNTPs. 

Any combination of nucleotides can be used to incorporate nucleotides including but not 
limited to unlabeled deoxynucleotides, labeled deoxynucleotides, unlabeled dideoxynucleotides, 
labeled dideoxynucleotides, a mixture of labeled and unlabeled deoxynucleotides, a mixture of 
labeled and unlabeled dideoxynucleotides, a mixture of labeled deoxynucleotides and labeled 

20 dideoxynucleotides, a mixture of labeled deoxynucleotides and unlabeled dideoxynucleotides, a 
mixture of unlabeled deoxynucleotides and unlabeled dideoxynucleotides, a mixture of unlabeled 
deoxynucleotides and labeled dideoxynucleotides, dideoxynucleotide analogues, deoxynucleotide 
analogues, a mixture of dideoxynucleotide analogues and deoxynucleotide analogues, 
phosphorylated nucleoside analogues, 2 , -deoxynucleotide-5'-triphosphate, and modified 

25 2*-deoxynucleotide-5 '-triphosphate. 

For example, as shown in FIG. 1H, in the presence of a polymerase, the 3' recessed end 
can be filled in with fluorescent ddNTP using the 5' overhang as a template. The incorporated 
ddNTP can be detected using any suitable method including but not limited to fluorescence 
detection. 

30 All four nucleotides can be labeled with different fluorescent groups, which will allow one 

reaction to be performed in the presence of all four labeled nucleotides. Alternatively, four separate 
"fill in" reactions can be performed for each locus of interest; each of the four reactions will contain 
a different labeled nucleotide (e.g. ddATP*, ddTTP*, ddGTP*, or ddCTP*, where * indicates a 
labeled nucleotide). Each nucleotide can be labeled with different chemical groups or the same 

35 chemical groups. The labeled nucleotides can be dideoxynucleotides or deoxynucleotides. 
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In another embodiment, nucleotides can be labeled with fluorescent dyes including but not 
limited to fluorescein, pyrene, 7-methoxycoumarin, Cascade Blue.TM., Alexa Flur 350, Alexa Flur 
430, Alexa Flur 488, Alexa Flur 532, Alexa Flur 546, Alexa Flur 568, Alexa Flur 594, Alexa Flur 
633, Alexa Flur 647, Alexa Flur 660, Alexa Flur 680, AMCA-X, dialkylaminocoumarin, Pacific 
5 Blue, Marina Blue, BODIPY 493/503, BODIPY Fl-X, DTAF, Oregon Green 500, Dansyl-X, 
6-FAM, Oregon Green 488, Oregon Green 514,Rhodamine Green-X, Rhodol Green, Calcein, 
Eosin, ethidium bromide, NBD, TET, 2\ 4', 5\ T tetrabromosulfonefluorescien, BODIPY-R6G, 
BODIPY-F1 BR2, BODIPY 530/550, HEX, BODIPY 558/568, BODIPY-TMR-X., PyMPO, 
BODIPY 564/570, TAMRA, BODIPY 576/589, Cy3, Rhodamine Red-x, BODIPY 581/591, 

10 carboxyXrhodamine, Texas Red-X, BODIPY-TR-X., Cy5, SpectrumAqua, SpectrumGreen #1, 
SpectrumGreen #2, SpectrumOrange, SpectrumRed, or naphthofluorescein. 

In another embodiment, the "fill in" reaction can be performed with fluorescently labeled 
dNTPs, wherein the nucleotides are labeled with different fluorescent groups. The incorporated 
nucleotides can be detected by any suitable method including but not limited to Fluorescence 

1 5 Resonance Energy Transfer (FRET). 

In another embodiment, a mixture of both labeled ddNTPs and unlabeled dNTPs can be 
used for filling in the recessed 3 ' end of the SNP or locus of interest. Preferably, the 5' overhang 
consists of more than one base, including but not limited to 2, 3, 4, 5, 6 or more than 6 bases. For 
example, if the 5' overhang consists of the sequence "XGAA," wherein X is the locus of interest, 

20 e.g. SNP; then filling in with a mixture of labeled ddNTPs and unlabeled dNTPs will produce 

several different DNA fragments. If a labeled ddNTP is incorporated at position "X," the reaction 
will terminate and a single labeled base will be incorporated. If however, an unlabeled dNTP is 
incorporated, the polymerase continues to incorporate other bases until a labeled ddNTP is 
incorporated. If the first two nucleotides incorporated are dNTPs, and the third is a ddNTP, the 3* 

25 recessed end will be extend by three bases. This DNA fragment can be separated from the other 
DNA fragments that were extended by 1, 2, or 4 bases by size. A mixture of labeled ddNTPs and 
unlabeled dNTPs will allow all bases of the overhang to be filled in, and provides additional 
sequence information about the locus of interest, e.g. SNP (see FIGS. 7E and 9D). 

After incorporation of the labeled nucleotide, the amplified DNA can be digested with a 

30 restriction enzyme that recognizes the sequence provided by the first primer. For example, in FIG 
II, the amplified DNA is digested with a restriction enzyme that binds to region "a," which releases 
the DNA fragment containing the incorporated nucleotide from the streptavidin matrix. 

Alternatively, one primer of each primer pair for each locus of interest can be attached to a 
solid support matrix including but not limited to a well of a microtiter plate. For example, 

35 streptavidin-coated microtiter plates can be used for the amplification reaction with a primer pair, 
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wherein one primer is biotinylated. First, biotinylated primers are bound to the streptavidin-coated 
microtiter plates. Then, the plates are used as the reaction vessel for PGR amplification of the loci 
of interest. After the amplification reaction is complete, the excess primers, salts, and template 
DNA can be removed by washing. The amplified DNA remains attached to the microtiter plate. 
5 The amplified DNA can be digested with a restriction enzyme that recognizes a sequence on the 
second primer and generates a 5' overhang, which contains the locus of interest. The digested 
fragments can be removed by washing. After digestion, the SNP site or locus of interest is exposed 
in the 5' overhang. The recessed 3' end is filled in with a labeled nucleotide, including but not 
limited to, fluorescent ddNTP in the presence of a polymerase. The labeled DNA can be released 

10 into the supernatant in the microtiter plate by digesting with a restriction enzyme that recognizes a 
sequence in the 5' region of the first primer. 

In another embodiment, one nucleotide can be used to determine the sequence of multiple 
alleles of a gene, A nucleotide that terminates the elongation reaction can be used to determine the 
sequence of multiple alleles of a gene. At one allele, the terminating nucleotide is complementary 

15 to the locus of interest in the 5' overhang of said allele. The nucleotide is incorporated and 

terminates the. reaction. At a different allele, the terminating nucleotide is not complementary to the 
locus of interest, which allows a non-terminating nucleotide to be incorporated at the locus of 
interest of the different allele. However, the terminating nucleotide is complementary to a 
nucleotide downstream from the locus of interest in the 5* overhang of said'different allele. The 

20 sequence of the alleles can be determined by analyzing the patterns of incorporation of the 
terminating nucleotide. The terminating nucleotide can be labeled or unlabeled. 

In a another embodiment, the terminating nucleotide is a nucleotide that terminates or 
hinders the elongation reaction including but not limited to a dideoxynucleotide, a 
dideoxynucleotide derivative, a dideoxynucleotide analog, a dideoxynucleotide homolog, a 

25 dideoxynucleotide with a sulfur chemical group, a deoxynucleotide, a deoxynucleotide derivative, 
a deoxynucleotide homolog, a deoxynucleotide analog, a deoxynucleotide with a sulfur chemical 
group, arabinoside triphosphate, a arabinoside triphosphate analog, a arabinoside triphosphate 
homolog, or an arabinoside derivative. 

In another embodiment, a terminating nucleotide labeled with one signal generating moiety 

30 tag, including but not limited to a fluorescent dye, can be used to determine the sequence of the 
alleles of a locus of interest. The use of a single nucleotide labeled with one signal generating 
moiety tag eliminates any difficulties that can arise when using different fluorescent moieties. In 
addition, using one nucleotide labeled with one signal generating moiety tag to determine the 
sequence of alleles of a locus of interest reduces the number of reactions, aijd eliminates pipetting 

35 errors. 
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For example, if the second primer contains the restriction enzyme recognition site for 
BsmFI, digestion will generate a 5' overhang of 4 bases. The second primer can be designed such 
that the locus of interest is located in the first position of the overhang. A representative overhang 
is depicted below, where R represents the locus of interest: 

5 

5' CAC 

3'GTG R T G G 

Overhang position 12 3 4 

1 0 One nucleotide with one signal generating moiety tag can be used to determine whether the 

variable site is homozygous or heterozygous. For example, if the variable site is adenine (A) or 
guanine (G), then either adenine or guanine can be used to determine the sequence of the alleles of 
the locus of interest, provided that there is an adenine or guanine in the overhang at position 2, 3, or 
4. 

15 For example, if the nucleotide in position 2 of the overhang is thymidine, which is 

complementary to adenine, then labeled ddATP, unlabeled dCTP, dGTP, and dTTP can be used to 
determine the sequence of the alleles of the locus of interest. The ddATP can be labeled with any 
signal generating moiety including but not limited to a fluorescent dye. If the template DNA is 
homozygous for adenine, then labeled ddATP* will be incorporated at position 1 complementary to 

20 the overhang at the alleles, and no nucleotide incorporation will be seen at position 2, 3 or 4 
complementary to the overhang. 

Allele 1 5* CCC A* , 

3'GGG T T G G 

25 Overhang position 12 3 4 

Allele 2 5' CCC A* 

3'GGG T T G G 



30 



Overhang position 12 3 4 

One signal will be seen corresponding to incorporation of labeled ddATP at position 1 
complementary to the overhang, which indicates that the individual is homozygous for adenine at 
this position. This method of labeling eliminates any difficulties that may arise from using different 
dyes that have different quantum coefficients. 



35 
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Homozygous guanine: 

If the template DNA is homozygous for guanine, then no ddATP will be incorporated at 
position 1 complementary to the overhang, but ddATP will be incorporated at the first available 
position* which in this case is position 2 complementary to the overhang. For example, if the 
5 second position in the overhang corresponds to a thymidine, then: 

Allele 1 5'CCC G A* 

3 ' GGG C T G G 



10 



Overhang position 1 2 3 4 

Allele 2 5'CCC G A* 

3' GGG C T G G 

Overhang position 12 3 4 



15 One signal will be seen corresponding to incorporation of ddATP at position 2 

complementary to the. overhang, which indicates that the individual is homozygous for guanine. 
The molecules that are filled in at position 2 complementary to the overhang will have a different 
molecular weight than the molecules filled in at position 1 complementary to the overhang. 

20 Heterozygous condition: 



Allele 1 5' CCC A* 

3' GGG T T G G 

Overhang position 12 3 4 

Allele 2 5'CCC G A* 

3' GGG C T G G 

Overhang position 12 3 4 



25 



30 Two signals will be seen; the first signal corresponds to the ddATP filled in at position one 

complementary to the overhang and the second signal corresponds to the ddATP filled in at position 
2 complementary to the overhang. The two signals can be separated based on molecular weight; 
allele 1 and allele 2 will be separated by a single base pair, which allows easy detection and 
quantitation of the signals. Molecules filled in at position one can be distinguished from molecules 

35 filled in at position two using any method that discriminates based on molecular weight including 
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but not limited to gel electrophoresis, capillary gel electrophoresis, DNA sequencing, and mass 
spectrometry. It is not necessary that the nucleotide be labeled with a chemical moiety; the DNA 
molecules corresponding to the different alleles can be separated based on molecular weight. 

If position 2 of the overhang is not complementary to adenine, it is possible that positions 3 
or 4 may be complementary to adenine. For example, position 3 of the overhang may be 
complementary to the nucleotide adenine, in which case labeled ddATP may be used to determine 
the sequence of both alleles. 

Homozygous for adenine: 

Allele 1 5' CCC A* 

3'GGG T G T G 
Overhang position 1 2 3 4 

15 Allele 2 5' CCC A* 

3'GGG T G T G 

Overhang position 12 3 4 



10 



20 



Homozygous for guanine: 

Allele 1 5' CCC G C A* 

3'GGG C G T G 

Overhang position 12 3 4 



25 Allele 2 5' CCC G C A* 

3'GGG C G T G 

Overhang position 12 3 4 



30 



Heterozygous: 

Allele 1 5' CCC A* 

3'GGG T G T G 

Overhang position 12 3 4 

35 Allele 2 5' CCC G C A* 
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3'GGG C G T G 

Overhang position 12 3 4 

Two signals will be seen; the first signal corresponds to the ddATP filled in at position 1 
5 complementary to the overhang and the second signal corresponds to the ddATP filled in at position 
3 complementary to the overhang. The two signals can be separated based on molecular weight; 
allele 1 and allele 2 will be separated by two bases, which can be detected using any method that 
discriminates based on molecular weight. 

Alternatively, if positions 2 and 3 are not complementary to adenine (Le positions 2 and 3 
10 of the overhang correspond to guanine, cytosine, or adenine) but position 4 is complementary to 
adenine, labeled ddATP can be used to determine the sequence of both alleles. 

Homozygous for adenine: 

15 Allele 1 5' CCC A* 

3'GGG T G G T 

Overhang position 12 3 4 

Allele 2 5* CCC A* 

20 3'GGG T G G T 

Overhang position 12 3 4 

One signal will be seen that corresponds to the molecular weight of molecules filled in with 
ddATP at position one complementary to the overhang, which indicates that the individual is 
25 homozygous for adenine at the variable site. 

♦ 

Homozygous for guanine: 





Allele 1 5' CCC 


G 


c 


C 


A* 


30 


3'GGG 


C 


G 


G 


T 




Overhang position 


1 


2 


3 


4 




Allele 2 5' CCC 


G 


C 


C 


A* 




3'GGG 


C 


G 


G 


T 


35 


Overhang position 


1 


2 


3 


4 
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5 



15 



One signal will be seen that coiresponds to the molecular weight of molecules filled in at 
position 4 complementary to the overhang, which indicates that the individual is homozygous for 
guanine. 

Heterozygous: 



Allele 1 5' CCC A* 

3' GGG T G G T 

10 Overhang position 12 3 4 



Allele 2 5* CCC G C C A* 

3' GGG C G G T 



Overhang position 12 3 4 



Two signals will be seen; the first signal corresponds to the ddATP filled in at position one 
complementary to the overhang and the second signal corresponds to the ddATP filled in at position 
4 complementary to the overhang. The two signals can be separated based on molecular weight; 
allele 1 and allele 2 will be separated by three bases, which allows detection and quantitation of the 

20 signals. The molecules filled in at position 1 and those filled in at position 4 can be distinguished 
based on molecular weight 

As discussed above, if the variable site contains either adenine or guanine, either labeled 
adenine or labeled guanine can be used to determine the sequence of both alleles. If positions 2, 3, 
or 4 of the overhang are not complementary to adenine but one of the positions is complementary to 

25 a guanine, then labeled ddGTP can be used to determine whether the template DNA is homozygous 
or heterozygous for adenine or guanine. For example, if position 3 in the overhang corresponds to a 
cytosine then the following signals will be expected if the template DNA is homozygous for 
guanine, homozygous for adenine, or heterozygous: 



3 0 Homozygous for guanine: 

Allele 1 5' CCC G* 

3' GGG C 

Overhang position 1 

35 
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Allele 2 5' CCC G* 

3'GGG C T C T 

Overhang position 12 3 4 

5 One signal will be seen that corresponds to the molecular weight of molecules filled in with 

ddGTP at position one complementary to the overhang, which indicates that the individual is 
homozygous for guanine. 



10 



Homo2ygous for adenine: 

Allele 1 5' CCC A A G* 

3'GGG T T C T 

Overhang position 12 3 4 

15 Allele 2 5' CCC A A G* 

3* GGG T T C T 

Overhang position 12 3 4 

One signal will be seen that corresponds to the molecular weight of molecules filled in at 
20 position 3 complementary to the overhang, which indicates that the individual is homozygous for 
adenine at the variable site. 

Heterozygous: 

25 Allele 1 5' CCC G* 

3'GGG C T C T 

Overhang position 12 3 4 

Allele 2 5' CCC A A G* 

30 3'GGG T T C T 

Overhang position 12 3 4 

Two signals will be seen; the first signal corresponds to the ddGTP filled in at position one 
complementary to the overhang and the second signal corresponds to the ddGTP filled in at position 
35 3 complementary to the overhang. The two signals can be separated based on molecular weight; 
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allele 1 and allele 2 will be separated by two bases, which allows easy detection and quantitation of 
the signals. 

In another embodiment, the nucleotide labeled with a single chemical moiety, which is used 
to determine the sequence of alleles of interest, can be analyzed by a variety of methods including 
5 but not limited to fluorescence detection, DNA sequencing gel, capillary electrophoresis on an 
automated DNA sequencing machine, microchannel electrophoresis, and other methods of 
sequencing, mass spectrometry, time of flight mass spectrometry, quadrupole mass spectrometry, 
magnetic sector mass spectrometry, electric sector mass spectrometry infrared spectrometry, 
ultraviolet spectrometry, palentiostatic amperometry or by DNA hybridization techniques including 

10 Southern Blots, Slot Blots, Dot Blots, and DNA microarrays, wherein DNA fragments would be 
useful as both "probes" and "targets," ELISA, fluorimetry, Fluorescence Resonance Energy 
Transfer (FRET), SNP-IT, GeneChips, HuSNP, BeadArray, TaqMan assay, Invader assay, 
MassExtend, or MassCleave™ (hMC) method. 

Some type IIS restriction enzymes also display alternative cutting as discussed above. For 

15 example, BsmFI will cut at 10/14 and 1 1/15 from the recognition site. However, the cutting 
patterns are not mutually exclusive; if the 1 1/15 cutting pattern is seen at a particular sequence, 
10/14 cutting is also seen. If the restriction enzyme BsmF I cuts at 10/14 from the recognition site, 
the 5' overhang will be X1X2X3X4. If BsmF I cuts 1 1/15 from the recognition site, the 5' overhang 
will be X0X1X2X3. If position Xo of the overhang is complementary to the labeled nucleotide, the 

20 labeled nucleotide will be incorporated at position Xo and provides an additional level of quality 
assurance. It provides additional sequence information. 

For example, if the variable site is adenine or guanine, and position 3 in the overhang is 
complementary to adenine, labeled ddATP can be used to determine the genotype at the variable 
site. If position 0 of the 1 1/15 overhang contains the nucleotide complementary to adenine, ddATP 

25 will be filled in and an additional signal will be seen. 



Heterozygous: 



10/14 Allele 1 
Overhang position 
10/14 Allele 2 
Overhang position 



5' CCA A* 
3'GGT T 
1 

5' CCA G 
3'GGT C 
1 



G T G 

2 3 4 

C A* 

G T G 

2 3 4 
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11/15 Allele 1 5' CC A* 

3'GG T T G T 

Overhang position 0 12 3 



11/15 Allele 2 5' CC A* 

3'GG T C G T 

Overhang position 0 12 3 

1 0 Three signals are seen; one corresponding to the ddATP incorporated at position 0 

complementary to the overhang, one corresponding to the ddATP incorporated at position 1 
complementary to the overhang, and one corresponding to the ddATP incorporated at position 3 
complementary to the overhang. The molecules filled in at position 0, 1, and 3 complementary to 
the overhang differ in molecular weight and can be separated using any technique that discriminates 

15 based on molecular weight including but not limited to gel electrophoresis, and mass spectrometry. 

For quantitating the ratio of one allele to another allele or when determining the relative 
amount of a mutant DNA sequence in the presence of wild type DNA sequence, an accurate and 
highly sensitive method of detection must be used. The alternate cutting displayed by type IIS 
restriction enzymes may increase the difficulty of determining ratios of one allele to another allele 

20 because the restriction enzyme may not display the alternate cutting (1 1/1 5) pattern on the two 
alleles equally. For example, allele 1 may be cut at 10/14 80% of the time, and 1 1/15 20% of the 
time. However, because the two alleles may differ in sequence, allele 2 may be cut at 10/14 90% of 
the time, and 1 1/15 20% of the time. 

For purposes of quantitation, the alternate cutting problem can be eliminated when the 

25 nucleotide at position 0 of the overhang is not complementary to the labeled nucleotide. For 

example, if the variable site corresponds to adenine or guanine, and position 3 of the overhang is 
complementary to adenine (/.<?, a thymidine is located at position 3 of the overhang), labeled ddATP 
can be used to determine the genotype of the variable site. If position 0 of the overhang generated 
by the 1 1/15 cutting properties is not complementary to adenine, (/.e, position 0 of the overhang 

30 corresponds to guanine, cytosine, or adenine) no additional signal will be seen from the fragments 
that were cut 1 1/15 from the recognition site. Position 0 complementary to the overhang can be 
filled in with unlabeled nucleotide, eliminating any complexity seen from the alternate cutting 
pattern of restriction enzymes. This method provides a highly accurate method for quantitating the 
ratio of a variable site including but not limited to a mutation, or a single nucleotide polymorphism. 
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For instance, if SNP X can be adenine or guanine, this method of labeling allows 
quantitation of the alleles that correspond to adenine and the alleles that correspond to guanine, 
without determining if the restriction enzyme displays any differences between the alleles with 
regard to alternate cutting patterns. 

Heterozygous: 



10 



15 



20 
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5' CCG 


A* 
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1 


2 


3 


4 . 
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The overhang general 


ed by the alternate 


cutting properties of BsmF I is depicted below: 
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25 After filling in with labeled ddATP and unlabeled dGTP, dCTP, dTTP, the following 

molecules would be generated: * 

11/15 Allele 1 5>CC G A* 

3'GG C T G T 

30 Overhang position 0 12 3 
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Two signals are seen; one corresponding to the molecules filled in with ddATP at position 
one complementary to the overhang and one corresponding to the molecules filled in with ddATP at 
position 3 complementary to the overhang. Position 0 of the 1 1/15 overhang is filled in with 
unlabeled nucleotide, which eliminates any difficulty in quantitating a ratio for the nucleotide at the 
5 variable site on allele 1 and the nucleotide at the variable site on allele 2. 

Any nucleotide can be used including adenine, adenine derivatives, adenine homologues, 
guanine, guanine derivatives, guanine homologues, cytosine, cytosine derivatives, cytosine 
homologues, thymidine, thymidine derivatives, or thymidine homologues, or any combinations of 
adenine, adenine derivatives, adenine homologues, guanine, guanine derivatives, guanine 
10 homologues, cytosine, cytosine derivatives, cytosine homologues, thymidine, thymidine 
derivatives, or thymidine homologues. 

The nucleotide can be labeled with any chemical group or moiety, including but not limited 
to radioactive molecules, fluorescent molecules, antibodies, antibody fragments, haptens, 
carbohydrates, biotin, derivatives of biotin, phosphorescent moieties, luminescent moieties, 
1 5 electrochemiluminescent moieties, chromatic moieties, and moieties having a detectable electron 
spin resonance, electrical capacitance, dielectric constant or electrical conductivity. The nucleotide 
can be labeled with one or more than one type of chemical group or moiety. 

In another embodiment, labeled and unlabeled nucleotides can be used. Any combination 
of deoxynucleotides and dideoxynucleotides can be used including but not limited to labeled 
20 dideoxynucleotides and labeled deoxynucleotides; labeled dideoxynucleotides and unlabeled 

deoxynucleotides; unlabeled dideoxynucleotides and unlabeled deoxynucleotides; and unlabeled 
dideoxynucleotides and labeled deoxynucleotides. 

In another embodiment, nucleotides labeled with a chemical moiety can be used in the PCR 
reaction. Unlabeled nucleotides then are used to fill-in the 5* overhangs generated after digestion 
25 with the restriction en2yme. An unlabeled terminating nucleotide can be used to in the presence of 
unlabeled nucleotides to determine the sequence of the alleles of a locus of interest. 

For example, if labeled dTTP was used in the PCR reaction, the following 5' overhang 
would be generated after digestion with BsmF I: 



30 10/14 Allele 1 5' CT*G A 

3'GAC T G T G 

Overhang position 12 3 4 

10/14 Allele 2 5'CT*G G C A 

35 3'GAC C G T G 
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Overhang position 12 3 4 

Unlabeled ddATP, unlabeled dCTP, unlabeled dGTP, and unlabeled dTTP can be used to 
fill-in the 5* overhang. Two signals will be generated; one signal corresponds to the DNA 
5 molecules filled in with unlabeled ddATP at position 1 complementary to the overhang and the 
second signal corresponds to DNA molecules filled in with unlabeled ddATP at position 3 
complementaiy to the overhang. The DNA molecules can be separated based on molecular weight 
and can be detected by the fluorescence of the dTTP, which was incorporated during the PCR 
reaction. 

10 The labeled DNA loci of interest sites can be analyzed by a variety of methods including 

but not limited to fluorescence detection, DNA sequencing gel, capillary electrophoresis on an 
automated DNA sequencing machine, microchannel electrophoresis, and other methods of 
sequencing, mass spectrometry, time of flight mass spectrometry, quadrupole mass spectrometry, 
magnetic sector mass spectrometry, electric sector mass spectrometry infrared spectrometry, 

1 5 ultraviolet spectrometry, palentiostatic amperometry or by DNA hybridization techniques including 
Southern Blots, Slot Blots, Dot Blots, and DNA microarrays, wherein DNA fragments would be 
useful as both "probes" and "targets," ELISA, fluorimetry, Fluorescence Resonance Energy 
Transfer (FRET), SNP-IT, GeneChips, HuSNP, BeadArray, TaqMan assay, Invader assay, 
MassExtend, or MassCleave™ (hMC) method. 

20 This method of labeling is extremely sensitive and allows the detection of alleles of a locus 

of interest that are in various ratios including but not limited to 1:1, 1:2, 1:3, 1:4, 1:5, 1:6-1:10, 
1:11-1:20, 1:21-1:30, 1:31-1:40, 1:41-1:50, 1:51-1:60, 1:61-1:70, 1:71-1:80, 1:81-1:90, 1:91:1:100, 
1:101-1:200, 1:250, 1:251-1:300, 1:301-1:400, 1:401-1:500, 1:501-1:600, 1:601-1:700, 1:701- 
1:800, 1:801-1:900, 1:901-1:1000, 1:1001-1:2000, 1:2001-1:3000, 1:3001-1:4000, 1:4001-1:5000, 

25 1:5001-1:6000, 1:6001-1:7000, 1:7001-1:8000, 1:8001-1:9000, 1:9001-1:10,000; 1:10,001- 
1:20,000, 1:20,001:1:30,000, 1:30,001-1:40,000, 1:40,001-1:50,000, and greater than 1:50,000. 

For example, this method of labeling allows one nucleotide labeled with one signal 
generating moiety to be used to determine the sequence of alleles at a SNP locus, or detect a 
mutant allele amongst a population of normal alleles, or detect an allele encoding antibiotic 

30 resistance from a bacterial cell amongst alleles from antibiotic sensitive bacteria, or detect an allele 
from a drug resistant virus amongst alleles from drug-sensitive virus, or detect an allele from a non- 
pathogenic bacterial strain amongst alleles from a pathogenic bacterial strain. 

As shown above, a single nucleotide can be used to determine the sequence of the alleles at 
a particular locus of interest. This method is especially useful for determining if an individual is 

35 homozygous or heterozygous for a particular mutation or to determine the sequence of the alleles at 
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a particular SNP site. This method of labeling eliminates any errors caused by the quantum 
coefficients of various dyes. It also allows the reaction to proceed in a single reaction vessel 
including but not limited to a well of a microtiter plate, or a single eppendorf tube. 

This method of labeling is especially useful for the detection of multiple genetic signals in 
5 the same sample. For example, this method is useful for the detection of fetal DNA in the blood, 
serum, or plasma of a pregnant female, which contains both maternal DNA and fetal DNA. The 
maternal DNA and fetal DNA may be present in the blood, serum or plasma at ratios such as 97:3; 
however, the above-described method can be used to detect the fetal DNA. This method of labeling 
can be used to detect two, three, four or more than four different genetic signals in the sample 
10 population 

This method of labeling is especially useful for the detection of a mutant allele that is 
among a large population of wild type alleles. Furthermore, this method of labeling allows the 
detection of a single mutant cell in a large population of wild type cells. For example, this method 
of labeling can be used to detect a single cancerous cell among a large population of normal cells. 
15 Typically, cancerous cells have mutations in the DNA sequence. The mutant DNA sequence can be 
identified even if there is a large background of wild type DNA sequence. This method of labeling 
can be used to screen, detect, or diagnosis any type of cancer including but not limited to colon, 
renal, breast, bladder, liver, kidney, brain, lung, prostate, and cancers of the blood including 
leukemia. 

20 This labeling method can also be used to detect pathogenic organisms, including but not 

limited to bacteria, fungi, viruses, protozoa, and mycobacteria. It can also be used to discriminate 
between pathogenic strains of microorganism and non-pathogenic strains of microorganisms 
including but not limited to bacteria, fungi, viruses, protozoa, and mycobacteria. 

For example, there are several strains of Escherichia coli (£ coli), and most are non- 
25 pathogenic. However, several strains, such as £ coli 0157 are pathogenic. There are genetic 
differences between non-pathogenic £ coli strains and pathogenic £ coli. The above described 
method of labeling can be used to detect pathogenic microorganisms in a large population of non- 
pathogenic organisms, which are sometimes associated with the normal flora of an individual. 

30 VI. Analysis of the locus of interest 

The loci of interest can be analyzed by a variety of methods including but not limited to 
fluorescence detection, DNA sequencing gel, capillary electrophoresis on an automated DNA 
sequencing machine, (e.g. the ABI Prism 3100 Genetic Analyzer or the ABI Prism 3700 Genetic 
Analyzer), microchannel electrophoresis, and other methods of sequencing, Sanger dideoxy 

35 sequencing, mass spectrometry, time of flight mass spectrometry, quadrupole mass spectrometry, 
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magnetic sector mass spectrometry, electric sector mass spectrometry infrared spectrometry, 
ultraviolet spectrometry, palentiostatic amperometty or by DNA hybridization techniques including 
Southern Blot, Slot Blot, Dot Blot, and DNA microarray, wherein DNA fragments would be useful 
as both "probes" and 'targets," ELISA, fluorimetry, fluorescence polarization, Fluorescence 
5 , Resonance Energy Transfer (FRET), SNP-IT, GeneChips, HuSNP, BeadArray, TaqMan assay, 
Invader assay, MassExtend, or MassCleave™ (hMC) method. 

The loci of interest can be analyzed using gel electrophoresis followed by fluorescence 
detection of the incorporated nucleotide. Another method to analyze or read the loci of interest is to 
use a fluorescent plate reader or fluorimeter directly on the 96-weIl streptavidin coated plates. The 
1 0 plate can be placed onto a fluorescent plate reader or scanner such as the Pharmacia 9200 Typhoon 
to read each locus of interest. 

Alternatively, the PCR products of the loci of interest can be pooled and after "filling in" 
(FIG. 1 0), the products can be separated by size, using any method appropriate for the same, and 
then analyzed using a variety of techniques including but not limited to fluorescence detection, 

1 5 DNA sequencing gel, capillary electrophoresis on an automated DNA sequencing machine, 
microchannel electrophoresis, other methods of sequencing, Sanger dideoxy sequencing, DNA 
hybridization techniques including Southern Blot, Slot Blot, Dot Blot, and DNA microarray, mass 
spectrometry, time of flight mass spectrometry, quadrupole mass spectrometry, magnetic sector 
mass spectrometry, electric sector mass spectrometry infrared spectrometry, ultraviolet 

20 spectrometry, palentiostatic amperometry. For example, polyaciylamide gel electrophoresis can be 
used to separate DNA by size and the gel can be scanned to determine the color of fluorescence in 
each band (using e.g., ABI 377 DNA sequencing machine or a Pharmacia Typhoon 9200). 

In another embodiment, the sequence of the locus of interest can be determined by 
detecting the incorporation of a nucleotide that is 3' to the locus of interest, wherein said nucleotide 

25 is a different nucleotide from the possible nucleotides at the locus of interest. This embodiment is 
especially useful for the sequencing and detection of SNPs. The efficiency and rate at which DNA 
polymerases incorporate nucleotides varies for each nucleotide. 

According to the data from the Human Genome Project, 99% of all SNPs are binaiy. The 
sequence of the human genome can be used to determine a nucleotide that is 3* to the SNP of 

30 interest. When a nucleotide that is 3 ' to the SNP site differs from the possible nucleotides at the 
SNP site, a nucleotide that is one or more than one base 3' to the SNP can be used to determine the 
sequence of the SNP site. 

For example, suppose the sequence of SNP X on chromosome 13 is to be determined. The 
sequence of the human genome indicates that SNP X can either be adenosine or guanine and that a 

35 nucleotide 3' to the locus of interest is a thymidine. A primer that contains a restriction enzyme 

79 



t 



WO 2004/078994 PCT/US2004/006337 

recognition site for BsmF I, which is designed to be 13 bases from the locus of interest after 
amplification, is used to amplify a DNA fragment containing SNP X. Digestion with the restriction 
enzyme BsmF I generates a 5' overhang that contains the locus of interest, which can either be 
adenosine or guanine. The digestion products can be split into two "fill in" reactions: one contains 
5 dTTP, and the other reaction contains dCTP. If the locus of interest is homozygous for guanine, 
only the DNA molecules that were mixed with dCTP will be filled in. If the locus of interest is 
homozygous for adenosine, only the DNA molecules that were mixed with dTTP will be filled in. 
If the locus of interest is heterozygous, the DNA molecules that were mixed with dCTP will be 
filled in as well as the DNA molecules that were mixed with dTTP. After washing to remove the 

10 excess dNTP, the samples are filled in with labeled ddATP, which is complimentary to the 

nucleotide (thymidine) that is 3' to the locus of interest. The DNA molecules that were filled in by 
the previous reaction will be filled in with labeled ddATP. If the individual is homozygous for 
adenosine, the DNA molecules that were mixed with dTTP subsequently will be filled in with the 
labeled ddATP. However, the DNA molecules that were mixed with dCTP, would not have 

15 incorporated that nucleotide, and therefore, could not incorporate the ddATP. Detection of labeled 
ddATP only in the molecules that were mixed with dTTP indicates that the nucleotide at SNP X on 
chromosome 13 is adenosine. 

In another embodiment, large scale screening for the presence or absence of single 
nucleotide polymorphisms or mutations can be performed. One to tens to hundreds to thousands of 

20 loci of interest on a single chromosome or on multiple chromosomes can be amplified with primers 
as described above in the "Primer Design" section. The primers can be designed so that each 
amplified loci of interest is of a different size (FIG. 2). The multiple loci of interest can be of a 
DNA sample from one individual representing multiple loci of interest on a single chromosome, 
multiple chromosomes, multiple genes, a single gene, or any combination thereof. 

25 When human data is being analyzed, the known sequence can be a specific sequence that 

has been determined from one individual (including e.g. the individual whose DNA is currently 
being analyzed), or it can be a consensus sequence such as that published as part of the human 
genome. 

Ratio of Alleles at Heterozygous Locus of Interest 
30 In one embodiment, the ratio of alleles at a heterozygous locus of interest can be calculated. 

The intensity of a nucleotide at the loci of interest can be quantified using any number of computer 
programs including but not limited to GeneScan and ImageQuant. For example, for a heterozygous 
SNP, there are two nucleotides , and each should be present in a 1 : 1 ratio. In a preferred 
embodiment, the ratio of multiple heterozygous SNPs can be calculated. 
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In one embodiment, the ratio for a variable nucleotide at alleles at a heterozygous locus of 
interest can be calculated. The intensity of each variable nucleotide present at the loci of interest 
can be quantified using any number of computer programs including but not limited to GeneScan 
and ImageQuant. For example, for a heterozygous SNP, there will be two nucleotides present, and 
5 each may be present in a 1 : 1 ratio. In a preferred embodiment, the ratio of multiple heterozygous 
SNPs can be calculated. 

In another embodiment, the ratio of alleles at a heterozygous locus of interest on a 
chromosome is summed and compared to the ratio of alleles at a heterozygous locus of interest on a 
different chromosome. In a preferred embodiment, ,the ratio of alleles at multiple heterozygous loci 

10 of interest on a chromosome is summed and compared to the ratio of alleles at multiple 

heterozygous loci of interest on a different chromosome. The ratio obtained from SNP 1, SNP 2, 
SNP 3, SNP 4, etc on chromosome 1 can be summed. This ratio can then be compared to the ratio 
obtained from SNP A, SNP B, SNP C, SNP D, etc. 

For example, 100 SNPs can be analyzed on chromosome 1. Of these 100 SNPs, assume 50 

15 are heterozygous. The ratio of the alleles at heterozygous SNPs on chromosome 1 can be summed, 
and should give a ratio of approximately 50:50. Likewise, of 100 SNPs analyzed on chromosome 
21, assume 50 are heterozygous. The ratio of alleles at heterozygous SNPs on chromosome 21 is 
summed. With a normal number of chromosomes, the ratio should be approximately 50:50, and 
thus there should be no difference between the ratio obtained from chromosome 1 and 21. 

20 However, if there is an additional copy of chromosome 21, an additional allele will be provided, 
and the ratio should be approximately 66:33. Thus, the ratio for nucleotides at heterozygous SNPs 
can be used to detect the presence or absence of chromosomal abnormalities. Any chromosomal 
abnormality can be detected including aneuploidy, polyploidy, inversion, a trisomy, a monosomy, 
duplication, deletion, deletion of a part of a chromosome, addition, addition of a part of 

25 chromosome, insertion, a fragment of a chromosome, a region of a chromosome, chromosomal 
rearrangement, and translocation. The method is especially useful for the detection of trisomy 13, 
trisomy 18, trisomy 21, XXY, and XYY. 

The present invention provides a method to quantitate a ratio for the alleles at a 
heterozygous locus of interest. The loci of interest include but are not limited to single nucleotide 

30 polymorphisms, mutations. There is no need to amplify the entire sequence of a gene or to 
quantitate the amount of a particular gene product. The present invention does not rely on 
quantitative PCR. 

Detection of Fetal Chromosomal Abnormalities 

As discussed above in the section entitled "DNA template," the template DNA can be 
35 obtained from a sample of a pregnant female, wherein the template DNA comprises maternal 
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template DNA and fetal template DNA. In one embodiment, the template DNA is obtained from 
, the blood of a pregnant female. In a preferred embodiment, the template DNA is obtained from the 
plasma or serum from the blood of a pregnant female. 

In one embodiment, the template DNA from the sample from the pregnant female 
5 comprises both maternal template DNA and fetal template DNA. In another embodiment, maternal 
template DNA is obtained from any nucleic acid containing source including but not limited to cell, 
tissue, blood, serum, plasma, saliva, urine, tears, vaginal secretion, lymph fluid, cerebrospinal fluid, 
mucosa secretion, peritoneal fluid, ascitic fluid, fecal matter, or body exudates, and sequenced to 
identify homo2ygous or heterozygous loci of interest, which are the loci of interest analyzed on the 
10 template DNA obtained from the sample from the pregnant female. 

In a preferred embodiment, the sequence of the alleles of multiple loci of interest on 
maternal template DNA is determined to identify homozygous loci of interest. In another 
embodiment, the sequence of the alleles of multiple loci of interest on maternal template DNA is 
determined to identify heterozygous loci of interest The sequence of the alleles of multiple loci of 
15 interest on maternal template DNA can be determined in a single reaction or in multiple reactions. 

For example, if 100 maternal loci of interest on chromosome 21 and 100 maternal loci of 
interest on chromosome 1 are analyzed, one would predict approximately 50 loci of interest on each 
chromosome to be homozygous and 50 to be heterozygous. The 50 homozygous loci of interest, or 
the 50 heterozygous loci of interest or the 50 homozygous and 50 heterozygous loci of interest, or 
20 any combination of the homozygous and heterozygous loci of interest on each chromosome can be 
analyzed using the template DNA from the sample from the pregnant female. 

The locus of interest on the template DNA from the sample of the pregnant female is 
analyzed using the amplification, isolation, digestion, fill in, and detection methods described 
above. The same primers used to analyze the locus of interest on the maternal template DNA are 
25 used to screen the template DNA from the sample from the pregnant female. Any number of loci of 
interest can be analyzed on the template DNA from the sample from the pregnant female. For 
example, 1, 1-5, 5-10, 10-20, 20-30, 30-40, 40-50, 50-60, 60-70, 70-80, 80-90, 90-100, 100-150, 
150-200, 200-250, 250-300, 300-500, 500-1000, 1000-2000, 2000-3000, 3000-4000 or more than 
4000 homozygous maternal loci of interest can be analyzed in the template DNA from the sample 
30 from the pregnant female. In a preferred embodiment, multiple loci of interest on multiple 
chromosomes are analyzed 

From the population of homozygous maternal loci of interest, there will be both 
heterozygous and homozygous loci of interest from the template DNA from the sample from the 
pregnant female; the heterozygous loci of interest can be further analyzed. At heterozygous loci of 
35 interest, the ratio of alleles can be used to determine the number of chromosomes that are present. 
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The percentage of fetal DNA present in the sample from the pregnant female can be 
calculated by determining the ratio of alleles at a heterozygous locus of interest on a chromosome 
that is not typically associated with a chromosomal abnormality. In a preferred embodiment, the 
ratio of alleles at multiple heterozygous loci of interest on a chromosome can be used to determine 
5 the percentage of fetal DNA. For example, chromosome 1, which is the largest chromosome in the 
human genome, can be used to determine the percentage of fetal DNA. 

For example, suppose SNP X is homozygous.at the maternal template DNA (A/A). At 
SNP X, the template DNA from the sample from the pregnant female, which can contain both fetal 
DNA and maternal DNA, is heterozygous (A/G). The nucleotide guanine represents the fetal DNA 
10 because at SNP X the mother is homozygous, and thus the guanine is attributed to the fetal DNA. 
The guanine at SNP X can be used to calculate the percentage of fetal DNA in the sample. 

Alternatively, multiple loci of interest on two or more chromosomes can be examined to 
determine the percentage of fetal DNA. For example, multiple loci of interest can be examined on 
chromosomes 13, and 18 to determine the percentage of fetal DNA because organisms with 
15 chromosomal abnormalities at chromosome 13 and 18 are not viable. 

Alternatively, for a male fetus, a marker on the Y chromosome can be used to determine the 
amount of fetal DNA present in the sample. A panel of serial dilutions can be made using the 
template DNA isolated from the sample from the pregnant female, and quantitative PGR analysis 
perfonned. Two PGR reactions can be performed: one PCR reaction to amplify a marker on the Y 
20 chromosome, for example SRY, and the other reaction to amplify a region on any of the autosomal 
chromosomes. The amount of fetal DNA can be calculated using the following formula: 

Percent Fetal DNA: (last dilution Y chromosome detected / last dilution autosomal 
chromosome detected) *2 * 100. 

If at SNP A, the mother is homozygous A/A, and the fetus is heterozygous A/G, then the 
. 25 ratio of A:G can be used to detect chromosomal abnormalities. If the fetal DNA is fifty percent 
(50%) of the DNA in the maternal blood, then at SNP A where the maternal nucleotide is an 
adenine and the other nucleotide is a guanine, one would expect the ratio of adenine (two adenines 
from the maternal template DNA and one from the fetal template DNA) to guanine (from the fetal 
template DNA) to be 25:75 or 0.33. However, if the fetus has a trisomy of this particular 
30 chromosome, and the additional chromosome is contributed by the mother, and thus an additional 
adenine nucleotide is present, then one would expect the ratio of 0.25 (50 (G)/ (2*50 maternal A + 
2*50 fetal A). Thus, there is a difference of 8% between the ratio obtained from a chromosome 
present in two copies, and a chromosome present in a trisomy condition. On the other hand, if the 
additional chromosome is contributed by the father, and thus, an additional guanine is present, then 
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one would expect the ratio of 0.66 (2*50 for G fetal allele / (2*50 maternal A allele + 50 for fetal A 
allele). 

However, if the fetal DNA is 40% of the DNA in the maternal blood, the expected ratio 
without a trisomy is 0.25 (40 for fetal G allele/ 2*60 for maternal A allele + 1*60 for fetal A allele). 
5 If the fetus has a trisomy, and the additional chromosome is provided by the mother, the expected 
ratio would be 0.20 (40 for fetal G allele / (2*60 for maternal A allele + 2* 40 for fetal A allele). A 
5% difference between the ratios obtained from a chromosome present in two copies and a • 
chromosome present in the Trisomy condition is detected. 

In another embodiment, multiple loci of interest on multiple chromosomes can be 
1 0 examined. The ratios for the alleles at each heterozygous locus of interest on a chromosome can be 
summed and compared to the ratios for the alleles at each locus of interest on a different 
chromosome. The chromosomes that are compared can be of human origin, and include but are not 
limited to chromosomes 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, X, 
and Y. The ratio obtained from multiple chromosomes can be compared to the ratio obtained for a 
15 single chromosome or from multiple chromosomes. 

In one embodiment, one of the chromosomes used in the comparison can be chromosome 
13, 15, 16; 18, 21, 22, X or Y. In a preferred embodiment, the ratios on chromosomes 13, 1 8, and 
21 are compared. 

For example, assuming 40% fetal DNA in the sample from the pregnant female, the ratio of 

20 the alleles at a heterozygous locus of interest on chromosome 1 will be 0.25 (40 for fetal G allele/ 
(2*60 for maternal A allele + 40 for fetal A allele). Likewise, the ratio of alleles at a heterozygous 
locus of interest on chromosome 21 will be present in a ratio of 0.25. However, in a fetus with 
trisomy 21 where the additional chromosome is contributed by the mother, the nucleotides at a 
heterozygous locus of interest on chromosome 2 1 will be present in a ratio of 0.20 (40 for fetal G 

25 al!ele/(60*2 for maternal A allele + 40* 2 for fetal A allele). By contrast, the ratio for chromosome 
1 will remain at 0.25, and thus the 5% difference in ratios will signify an additional chromosome. 
One to tens to hundreds to thousands of loci of interest can be analyzed. 

In another embodiment, the loci of interest on the template DNA from the sample from the 
pregnant female can be genotyped without prior identification of the homozygous maternal loci of 

30 interest. It is not necessary to genotype the maternal template DNA prior to analysis of the template 
DNA containing both maternal and fetal template DNA. 

The ratio of the alleles at the loci of interest can be used to determine the presence or 
absence of a chromosomal abnormality. The template DNA from the sample from the pregnant 
female contains both maternal template DNA and fetal template DNA. There are 3 possibilities at 

35 each SNP for either the maternal template DNA or the fetal template DNA: heterozygous, 
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homozygous for allele 1, or homozygous for allele 2. The possible nucleotide ratios for a SNP that 
is either an adenine or a guanine are shown in Table II. The ratios presented in Table II are 
calculated with the fetal DNA at 50% of the DNA in the sample from the pregnant female. 



Table II. Ratios for nucleotides for a heterozygous SNP. 

Fetal SNP 



Maternal SNP 


A/A 


G/G 


A/G 


A/A 


100% A 


N/A 


75%A,25%G 


G/G 


N/A 


100% G 


25%A,75%G 


A/G 


75%A,25%G 


25% A, 75% G 


50% A, 50% G 



5 



There are three nucleotide ratios: 100% of a single nucleotide, 50:50, or 75:25. These 
ratios will vary depending on the amount of fetal DNA present in sample from the pregnant female. 
However, the percentage of fetal DNA should be constant regardless of the chromosome analyzed. 
Therefore, if chromosomes are present in two copies, the above calculated ratios will be seen. 
10 On the other hand, these percentages will vary when an additional chromosome is present 

For example, assume that SNP X can be adenine or guanine, and that the percentage of fetal DNA 
in the sample from the pregnant female is 50%. Analysis of the loci of interest on chromosome 1 
will provide the ratios discussed above: 100:0, 50:50, and 75:25. The possible ratios for a SNP that 
is A/G with an additional chromosome are provided in Table m. 

15 Table ID: Nucleotides ratios at a SNP when an additional 

copy of a chromosome is present 



Fetal SNP 



Maternal 
SNPX 


A/A/A 


G/G/G 


A/G/G 


A/A/G 


A/A 


100% A 


N/A 


60% A, 40%G 


80%A,20%G 


G/G 


N/A 


100% G 


20% A, 80% G 


40% A, 60% G 


A/G 


80% A, 20% G 


20% A 80% G 


40% A, 60% G 


60% A, 40% G 



The possible ratios for the alleles at a heterozygous SNP with an additional copy of a 
20 chromosome are: 0:100, 40:60, and 20:80. Two of these ratios, 40:60, and 20:80 differ from the 
ratios of alleles at heterozygous SNPs obtained with two copies of a chromosome. As discussed 
above, the ratios for the nucleotides at a heterozygous SNP depend on the amount of fetal DNA 
present in the sample. However, the ratios, whatever they are, will remain constant across 
chromosomes unless there is a chromosomal abnormality. 
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The ratio of alleles at heterozygous loci of interest on a chromosome can be compared to 
the ratio for alleles at heterozygous loci of interest on a different chromosome. For example, the 
ratio for multiple loci of interest on chromosome 1 (the ratio at SNP 1, SNP 2, SNP 3, SNP 4, etc.) 
can be compared to the ratio for multiple loci of interest on chromosome 21 (the ratio at SNP A, 
5 SNP B, SNP C, SNP D, etc.). Any chromosome can be compared to any other chromosome. There 
is no limit to the number of chromosomes that can be compared. 

Referring back to the data in Tables II and HI, the ratios for nucleotides at a heterozygous 
SNP on chromosome 1, which was present in two copies, were 25:75, and 50:50. On the other, the 
ratio for nucleotides at a heterozygous SNP on chromosome 21, which was present in three copies, 

10 were 40:60, and 20:80. The difference between these two ratios indicates a chromosomal 

abnormality. The ratios can be pre-calculated for the full range of varying degrees of fetal DNA 
present in the maternal serum. Tables II and m demonstrate that both maternal homozygous and 
heterozygous loci of interest can be used to detect the presence of a fetal cliromosomal abnormality. 
The above example illustrates how the ratios for nucleotides at heterozygous SNPs can be 

1 5 used to detect the presence of an additional chromosome. The same type of analysis can be used to 
detect chromosomal rearrangements, translocations, mini-chromosomes, duplications of regions of 
chromosomes, monosomies, deletions of regions of chromosomes, and fragments of chromosomes. 
The method does not require genotyping of the mother or the father, however, it may be done to 
reduce the number of SNPs that need to be analyzed with the plasma sample. 

20 The present invention does not quantitate the amount of a fetal gene product, nor is the 

utility of the present invention limited to the analysis of genes found on the Y chromosome. The 
present invention does not merely rely on the detection of a paternally inherited nucleic acid, rather, 
the present invention provides a method that allows the ratio of maternal to fetal alleles at loci of 
interest, including SNPs, to be calculated. 

25 In another embodiment, a single allele at a locus of interest can be used to determine the 

presence or absence of a chromosomal abnormality and detect a genetic disorder in the fetus. In a 
preferred embodiment, the maternal allele at a locus of interest is used to determine the presence or 
absence of a chromosomal abnormality in the fetus. The biological mother can be genotyped to 
identify a homozygous locus of interest. Likewise, the biological father can be genotyped to 

30 identify a homozygous locus of interest. The locus of interest wherein the maternal template DNA 
is homozygous for one allele and the paternal template DNA is homozygous for the other allele is 
analyzed using the template DNA obtained from the plasma of the mother, which contains both 
maternal and fetal template DNA. Any number of loci of interest can be analyzed including but not 
limited to 1, 1-5, 5-10, 10-20, 20-30, 30-40, 40-50, 50-60, 60-70, 70-80, 80-90, 90-100, 100-150, 
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150-200, 200-250, 250-300, 300-500, 500-1000, 1000-2000, 2000-3000, 3000-4000, 4000-8000, 
8000-16000, 16000-32000 or greater than 32000 loci of interest. 

In a preferred embodiment, the signal from the maternal genome and the fetal allele, which 
was inherited from the mother, at the locus of interest is quantitated. For example, if the 5' 
overhang, which is generated after digestion with the type US enzyme, is filled in with a nucleotide 
that is fluorescently labeled, the intensity of the incorporated dye can be quantitated 

Maternal Template DNA - - Homozygous for Adenine 
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Template DNA in the plasma - - both maternal template DNA and fetal template DNA 
Maternal Template DNA - - Homozygous for Adenine 
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Allele 2 5' CCG A* 

3'GGC T G T G 

Overhang position 12 3 4 



Fetal Template DNA - -Heterozygous 

10 Allele! 5' CCG A* 

3'GGC T G T G 

Overhang position 12 3 4 

Allele 2 5' CCG ddC 

15 3'GGC T G T G 

Overhang position 12 3 4 

The template DNA obtained from the plasma of the pregnant female is filled in with labeled 
ddATP, and unlabeled ddCTP (depicted as ddC above), ddGTP, and ddTTP. The plasma DNA 

20 contains two maternal adenine alleles, and one fetal adenine allele. By filling in with labeled 

ddATP and unlabeled ddCTP, only the maternal allele and the fetal allele inherited from the mother 
are detected. The paternal allele is not detected in this manner. The fill-in reactions can be 
performed as described in the Examples below. 

A single locus of interest can be analyzed or multiple loci of interest. The intensity of the 

25 maternal allele at multiple loci of interest can be quantitated. An average can be calculated for a 
chromosome and compared to the average obtained for a different chromosome. For example, the 
average intensity of the maternal allele and the fetal allele inherited from the mother at chromosome 
1 can be compared to the average intensity of the maternal allele and the fetal allele inherited from 
the mother at chromosomes 13, 18, or 21. In a preferred embodiment, chromosomes 13, 15, 18, 21, 

30 22, X and Y, when applicable, are compared. 

The signal from a locus of interest may be stronger than another locus of interest. 
However, there is no reason why the signal from the locus of interest on one chromosome would be 
stronger than the signal from the locus of interest on another chromosome. While the signal from 
various loci of interest may be variable, the variation should be seen across the genome. The 

35 average signal of the loci of interest should be the same when any chromosomes are compared. 
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The conditions of the PCR reaction can be optimized so that an equivalent amount of PCR 
product is produced. For example, the concentration of the primers, the concentration of 
nucleotides, and the number of cycles for each loci of interest can be optimized. In addition, the 
fill-in reactions can be done under conditions such that any increase in a specific allele can be 
5 detected. The fill-in reaction conditions can be optimized to detect any increase in the allele of 
interest including but not limited to the concentration of reagents, the time of the fill-in reaction, 
and the temperature of the reaction. 

With a normal genetic karyotype, the signal at each locus of interest comprise signal from 
the maternal genome, and signal from the fetal allele, which was inherited from the mother. The 

10 percent of fetal DNA in the sample remains constant, regardless of the chromosome that is 

analyzed. For example, if at SNP X, the maternal genome is A/A, and the paternal genome is G/G, 
then the fetal genome will be A/G, and the fetal adenine allele will comprise a specified percentage 
of the signal from the adenine allele. If the percentage of fetal DNA is 20% in the maternal plasma, 
then the fetal adenine allele will contribute 20% of the signal for the adenine allele. The 

15 contribution of the fetal allele, which was inherited from the mother, will be constant for any locus 
of interest that is analyzed. 

When there is a chromosomal abnormality, the signal from the maternal genome and the 
fetal allele, which was inherited from the mother, at the loci of interest will differ from the signal 
observed for other chromosomes. For example, with a Trisomy, the signal at the locus of interest 

20 will comprise the maternal genome and two fetal alleles, which were inherited from the mother. 
The signal from the loci of interest for the chromosome that is present in three copies will have the 
contribution of an additional fetal allele, which will alter the signal of the alleles at these loci of 
interest. 

In another embodiment, a ratio can be calculated using a single allele and a standard DNA 
25 of known quantity. In a preferred embodiment, a ratio is calculated using the alleles of the maternal 
genome, and the fetal allele, which was inherited from the mother, and a standard DNA. The 
biological mother can be genotyped to identify a homozygous locus of interest. Likewise, the 
biological father can be genotyped to identify a homozygous locus of interest. The locus of interest 
wherein the maternal template DNA is homozygous for one allele and the paternal template DNA is 
30 homozygous for the other allele is analyzed using the template DNA obtained from the plasma of 
the mother, which contains both maternal and fetal template DNA. 

In a preferred embodiment, the signal from the maternal genome and the fetal allele, which 
was inherited from the mother, at the locus of interest is quantitated. For example, if the 5* 
overhang, which is generated after digestion with the type IIS enzyme, is filled in with a nucleotide 
35 that is fluorescently labeled, the intensity of the incorporated dye can be quantitated. 
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Template DNA in the plasma - - both maternal template DNA and fetal template DNA 
Maternal Template DNA - - Homozygous for Adenine 



Allele 1 5' CCG A* 

3'GGC T G T G 
Overhang position 12 3 4 



Allele 2 5' CCG A* 

3'GGC T G T Q 

Overhang position 12 3 4 



Fetal Template DNA - -Heterozygous 

Allele 1 5' CCG A* 

3'GGC T G T G 

Overhang position 12 3 4 



Allele 2 5' CCG ddC 

3'GGC T G T G 

Overhang position 12 3 4 



The template DNA obtained from the plasma of the pregnant female is filled in with labeled 
ddATP, and unlabeled ddCTP (depicted as ddC above), ddGTP, and ddTTP. The plasma DNA 
contains two maternal adenine alleles, and one fetal adenine allele. By filling in with labeled 
ddATP and unlabeled ddCTP, only the maternal allele and the fetal allele inherited from the mother 
are detected. 

A single locus of interest or multiple loci of interest can be analyzed. For each locus of 
interest, a DNA molecule is designed to migrate at about the same position as the locus of interest. 
In a preferred embodiment, the DNA molecule is of known quantity. A ratio is calculated using the 
alleles of the maternal genome and the fetal allele, which was inherited from the mother, and the 
DNA molecule designed to migrate at about the same position as the locus of interest. For 
example, if the locus of interest is designed to migrate at 30 base pairs, the DNA molecule can be 
designed to migrate at about 30 base pairs including but not limited to 20-25, 25-30, 30-35, 35-45, 
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and greater than 45. The alleles of the maternal genome and the fetal allele, which was inherited 
from the mother, and the standard DNA molecule can be analyzed in the same reaction or can be 
analyzed in a separate reaction. The alleles of the maternal genome and the fetal allele, which was 
inherited from the mother, and the standard DNA molecule can be analyzed in the same lane of a 
5 gel or can be analyzed in separate lanes of a gel The use of standard DNA molecules of known 
quantity, which are designed to migrate at the same position as the loci of interest, will coiTect for 
various factors including but not limited to the intensity of the bands relative to the location on the 
gel. 

The ratio of multiple loci of interest on a chromosome can be quantitated, and an average 
10 calculated. The average can be compared to the average obtained for another chromosome. The 
ratio is used to indicate the presence or absence of a chromosomal abnormality. Analysis of the 
alleles of the maternal genome and the fetal allele also allows detection of single gene or multi-gene 
genetic disorders. 

Any chromosome of any organism can be analyzed using the methods of the invention. For 

15 example, in humans, chromosome 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 
21, 22, X or Y can be analyzed using the methods of the invention. The ratio for the alleles at a 
heterozygous locus of interest on any chromosome can be compared to the ratio for the alleles at a 
heterozygous locus of interest on any other chromosome. 

Thus, the present invention provides a non-invasive technique, whi9h is independent of 

20 fetal cell isolation, for rapid, accurate and definitive detection of chromosome abnormalities in a 
fetus. The present invention also provides a non-invasive method for determining the sequence of 
DNA from a fetus. The present invention can be used to detect any alternation in gene sequence as 
compared to the wild type sequence including but not limited to point mutation, reading frame shift, 
transition, transversion, addition, insertion, deletion, addition-deletion, frame-shift, missense, 

25 reverse mutation, and microsatellite alteration. 

Detection of Fetal Chromosomal Abnormalities Using Short Tandem Repeats 

Short tandem repeats (STRs) are short sequences of DNA, normally of 2-5 base pairs in 
length, which are repeated numerous times in a head-tail manner. Tandemly repeated DNA 
sequences are widespread throughout the human genome, and show sufficient variability among the 

30 individuals in a population. Minisatellites have core repeats with 9-80 base pairs. 

In another embodiment, short tandem repeats can be used to detect fetal chromosomal 
abnormalities. Template DNA can be obtained from a nucleic acid containing sample including but 
not limited to cell, tissue, blood, serum, plasma, saliva, urine, tears, vaginal secretion, lymph fluid, 
cerebrospinal fluid, mucosa secretion, peritoneal fluid, ascitic fluid, fecal matter, or body exudates. 

35 In another embodiment, a cell lysis inhibitor is added to the nucleic acid cdntaining sample. In a 
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preferred embodiment, the template DNA is obtained from the blood of a pregnant female. In 
another embodiment, the template DNA is obtained from the plasma or serum from the blood of a 
pregnant female. 

The template DNA obtained from the blood of the pregnant female will contain both fetal 
5 DNA and maternal DNA. The fetal DNA comprises STRs from the mother and the father. The 
variation in the STRs between the mother and father can be used to detect chromosomal 
abnormalities. 

Primers can be designed to amplify short tandem repeats. Any method of amplification can 
be used including but not limited to polymerase chain reaction, self-sustained sequence reaction, 
1 0 ligase chain reaction, rapid amplification of cDNA ends, polymerase chain reaction and ligase chain 
reaction, Q-beta phage amplification, strand displacement amplification, and splice overlap 
extension polymerase chain reaction. In a preferred embodiment, PCR is used. 

Any number of short tandem repeats can be analyzed including but not limited to 1-5, 5-10, 
10-50, 50-100, 100-200, 200-300, 300-400, 400-500, 500-1000, and greater than 1000. The short 
1 5 tandem repeats can be analyzed in a single PCR reaction or in multiple PCR reactions. In a 
preferred embodiment, STRs from multiple chromosomes are analyzed. 

After amplification, the PCR products can be analyzed by any number of methods 
including but not restricted to gel electrophoresis, and mass spectrometry. The template DNA from 
the pregnant female comprises STRs of maternal and paternal origin. The STRs of paternal origin 
20 represent the fetal DNA. The paternal and maternal STRs may be identical in length or the 
maternal and the paternal STRs may differ. 

Heterozygous STRs are those of which the maternal and paternal differ in length. The 
amount of each PCR product can be quantitated for each heterozygous STR. With a normal number 
of chromosomes, the amount of each PCR product should be approximately equal. However, with 
25 an extra chromosome, one of the STR PCR products will be present at a greater amount. 

For example, multiple STRs on chromosome 1 can be analyzed on the template DNA 
obtained from the blood of the pregnant female. Each STR, whether of maternal or paternal origin, 
should be present at approximately the same amount. Likewise, with two chromosome 21s, each 
STR should be present at approximately the same amount. However, with a trisomy 21, one of the 
30 STR PCR products, when the maternal and paternal differ in length (a heterozygous STR) should be 
present at a higher amount. The ratio for each heterozygous STR on one chromosome can be 
compared to the ratio for each heterozygous STR on a different chromosome, wherein a difference 
indicates the presence or absence of a chromosomal abnormality. 
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Kits 

The methods of the invention are most conveniently practiced by providing the reagents 
used in the methods in the form of kits. A kit preferably contains one or more of the following 
components: written instructions for the use of the kit, appropriate buffers, salts, DNA extraction 
5 detergents, primers, nucleotides, labeled nucleotides, 5' end modification materials, and if desired, 
water of the appropriate purity, confined in separate containers or packages, such components 
allowing the user of the kit to extract the appropriate nucleic acid sample, and analyze the same 
according to the methods of the invention. The primers that are provided with the kit will vary, 
depending upon the purpose of the kit and the DNA that is desired to be tested using the kit. 

10 A kit can also be designed to detect a desired or variety of single nucleotide 

polymorphisms, especially those associated with an undesired condition or disease. For example, 
one kit can comprise, among other components, a set or sets of primers to amplify one or more loci 
of interest associated with Huntington's disease. Another kit can comprise, among other 
components, a set or sets of primers for genes associated with a predisposition to develop type I or 

15 type II diabetes. Still, another kit can comprise, among other components, a set or sets of primers 
for genes associated with a predisposition to develop heart disease. Details of utilities for such kits 
are provided in the "Utilities" section below. 
Utilities 

The methods of the invention can be used whenever it is desired to know the genotype of an 

20 individual. The method of the invention is especially useful for the detection of genetic disorders. 

The method of the invention is especially useful as a non-invasive technique for the detection of 

genetic disorders in a fetus. In a preferred embodiment, the method of the invention provides a 

method for identification of single nucleotide polymorphisms. 

In a preferred embodiment, the method is useful for detecting chromosomal abnormalities 

25 including but not limited to trisomies, monosomies, duplications, deletions, additions, chromosomal 

rearrangements, translocations, and other aneuploidies. The method is especially useful for the 

♦ 

detection of chromosomal abnormalities in a fetus. 

In a preferred embodiment, the method of the invention provides a method for 
identification of the presence of a disease in a fetus, especially a genetic disease that arises as a 

30 result of the presence of a genomic sequence, or other biological condition that it is desired to 
identify in an individual for which it is desired to know the same. The identification of such 
sequence in the fetus based on the presence of such genomic sequence can be used, for example, to 
determine if the fetus is a carrier or to assess if the fetus is predisposed to developing a certain 
genetic trait, condition or disease. The method of the invention is especially useful in prenatal 

35 genetic testing of parents and child. 
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Examples of diseases that can be diagnosed by this invention are listed in Table IV. 

TABLE IV 

Achondroplasia 

Adrenoleukodystrophy, X-Linked 
Agammaglobulinemia, X-Linked 
Alagille Syndrome 

Alpha-Thalassemia X-Linked Mental Retardation Syndrome 
Alzheimer Disease 

Alzheimer Disease, Early-Onset Familial 
Amyotrophic Lateral Sclerosis Overview 
Androgen Insensitivity Syndrome 
Angelman Syndrome 
Ataxia Overview, Hereditary 
Ataxia-Telangiectasia 

Becker Muscular Dystrophy also The Dystrophinopathies) 

Beckwith- Wiedemann Syndrome 

Beta-Thalassemia 

Biotinidase Deficiency 

Branchiootorenal Syndrome 

BRCA1 and BRCA2 Hereditary Breast/Ovarian Cancer 

Breast Cancer 

CADASDL 

Canavan Disease 

Cancer 

Charcot-Marie-Tooth Hereditary Neuropathy 
Charcot-Marie-Tooth Neuropathy Type 1 
Charcot-Marie-Tooth Neuropathy Type 2 
Charcot-Marie-Tooth Neuropathy Type 4 
Charcot-Marie-Tooth Neuropathy Type X 
Cockayne Syndrome * 
Colon Cancer 

Contractual Arachnodactyly, Congenital 
Craniosynostosis Syndromes (FGFR-Related) 
Cystic Fibrosis 
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Cystinosis 

Deafness and Hereditary Hearing Loss 
DRPLA (Dentatorubral-Pallidoluysian Atrophy) 
DiGeorge Syndrome (also 22ql 1 Deletion Syndrome) 
Dilated Cardiomyopathy, X-Linked 
Down Syndrome (Trisomy 21) 

Duchenne Muscular Dystrophy (also The Dystrophinopathies) 

Dystonia, Early-Onset Primary (DYT1) ~~ ~~ ~~ ~ 

Dystrophinopathies, The 

Ehlers-Danlos Syndrome, Kyphoscoliotic Form 

Ehlers-Danlos Syndrome, Vascular Type 

Epidermolysis Bullosa Simplex 

Exostoses, Hereditary Multiple 

Facioscapulohumeral Muscular Dystrophy 

Factor V Leiden Thrombophilia 

Familial Adenomatous Polyposis (FAP) 

Familial Mediterranean Fever 

Fragile X Syndrome 

Friedreich Ataxia 

Frontotemporal Dementia with Parkinsonism- 17 

Galactosemia 

Gaucher Disease 

Hemochromatosis, Hereditary 

Hemophilia A 

Hemophilia B 

Hemorrhagic Telangiectasia, Hereditary 

Hearing Loss and Deafness, Nonsyndromic, DFNA (Connexin 26) 

Hearing Loss and Deafness, Nonsyndromic, DFNB 1 (Connexin 26) 

Hereditary Spastic Paraplegia 

Hermansky-Pudlak Syndrome 

Hexosaminidase A Deficiency (also Tay-Sachs) 

Huntington Disease 

Hypochondroplasia 

Ichthyosis, Congenital, Autosomal Recessive 
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Incontinentia Pigmenti 

Kennedy Disease (also Spinal and Bulbar Muscular Atrophy) 
Krabbe Disease 

Leber Hereditary Optic Neuropathy 
Lesch-Nyhan Syndrome Leukemias 
Li-Fraumeni Syndrome 
Limb-Girdle Muscular Dystrophy 
Lipoprotein Lipase Deficiency, Familial 
Lissencephaly 
Marfan Syndrome 

MELAS (Mitochondrial Encephalomyopathy, Lactic Acidosis, and 

Stroke-Like Episodes) 

Monosomies 

Multiple Endocrine Neoplasia Type 2 

Multiple Exostoses, Hereditary Muscular Dystrophy, Congenital 
Myotonic Dystrophy • 
Nephrogenic Diabetes Insipidus 
Neurofibromatosis 1 
Neurofibromatosis 2 

Neuropathy with Liability to Pressure Palsies, Hereditary 

Niemann-Pick Disease Type C 

Nijmegen Breakage Syndrome Norrie Disease 

Oculocutaneous Albinism Type 1 

Oculopharyngeal Muscular Dystrophy 

Ovarian Cancer 

Pallister-Hall Syndrome 

Parkin Type of Juvenile Parkinson Disease 

Pelizaeus-Merzbacher Disease 

Pendred Syndrome 

Peutz- Jeghers Syndrome Phenylalanine Hydroxylase Deficiency 
Prader-Willi Syndrome 

PROP 1 -Related Combined Pituitary Hormone Deficiency (CPHD) 
Prostate Cancer 
Retinitis Pigmentosa 
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Retinoblastoma 

Rothmund-Thomson Syndrome 
Smith-Lemli-Opitz Syndrome 
Spastic Paraplegia, Hereditary 

Spinal and Bulbar Muscular Atrophy (also Kennedy Disease) 

Spinal Muscular Atrophy 

Spinocerebellar Ataxia Type 1 

Spinocerebellar Ataxia Type 2 

Spinocerebellar Ataxia Type 3 

Spinocerebellar Ataxia Type 6 

Spinocerebellar Ataxia Type 7 

Stickler Syndrome (Hereditary Arthroophthalmopathy) 

Tay-Sachs (also GM2 Gangliosidoses) r 

Trisomies 

Tuberous Sclerosis Complex 
Usher Syndrome Type I 
Usher Syndrome Type II 

Velocardiofacial Syndrome (also 22ql 1 Deletion Syndrome) 
Von Hippel-Lindau Syndrome 
Williams Syndrome 
Wilson Disease 

X-Linked Adrenoleukodystrophy 
X-Linked Agammaglobulinemia 

X-Linked Dilated Cardiomyopathy (also The Dystrophinopathies) 
X-Linked Hypotonic Facies Mental Retardation Syndrome 



The method of the invention is useful for screening an individual at multiple loci of interest, 
such as tens, hundreds, or even thousands of loci of interest associated with a genetic trait or genetic 
disease by sequencing the loci of interest that are associated with the trait or disease state, especially 
those most frequently associated with such trait or condition. The invention is useful for analyzing 
a particular set of diseases including but not limited to heart disease, cancer, endocrine disorders, 
immune disorders, neurological disorders, musculoskeletal disorders, ophthalmologic disorders, 
genetic abnormalities, trisomies, monosomies, transversions, translocations, skin disorders, and 
familial diseases. 
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The method of the invention can also be used to confirm or identify the relationship of a 
DNA of unknown sequence to a DNA of known origin or sequence, for example, for use in, 
maternity or paternity testing, and the like. 

Having now generally described the invention, the same will become better understood by 
reference to certain specific examples which are included herein for purposes of illustration only 
and are not intended to be limiting unless other wise specified. 

EXAMPLES 

The following examples are illustrative only and are not intended to limit the scope of the 
invention as defined by the claims. 

EXAMPLE 1 

DNA sequences were amplified by PCR, wherein the annealing step in cycle 1 was 
performed at a specified temperature, and then increased in cycle 2, and further increased in cycle 3 
for the purpose of reducing non-specific amplification. The TM1 of cycle 1 of PCR was 
determined by calculating the melting temperature of the 3' region, which anneals to the template 
DNA, of the second primer. For example, in FIG. IB, the TM1 can be about the melting 
temperature of region "c." The annealing temperature was raised in cycle 2, to TM2, which was 
about the melting temperature of the 3' region, which anneals to the template DNA, of the first 
primer. For example, in FIG. 1C, the annealing temperature (TM2) corresponds to the melting 
temperature of region "b." In cycle 3, the annealing temperature was raised to TM3, which was 
about the melting temperature of the entire sequence of the second primer. 'For example, in FIG. 
ID, the annealing temperature (TM3) corresponds to the melting temperature of region "c" + region 
"d". The remaining cycles of amplification were performed at TM3. 
Preparation of Template DNA 

The template DNA was prepared from a 5 ml sample of blood obtained by venipuncture 
from a human volunteer with informed consent. Blood was collected from 36 volunteers. 
Template DNA was isolated from each blood sample using QIAamp DNA Blood Midi Kit supplied 
by QIAGEN (Catalog number 5 1 1 83). Following isolation, the template DNA from each of the 36 
volunteers was pooled for further analysis. 
Primer Design 

The following four single nucleotide polymorphisms were analyzed: SNP HC21S00340, 
identification number as assigned by Human Chromosome 21 cSNP Database, (FIG. 3, lane 1) 
located on chromosome 21; SNP TSC 0095512 (FIG. 3, lane 2) located on chromosome 1, SNP 
TSC 0214366 (FIG. 3, lane 3) located on chromosome 1; and SNP TSC 0087315 (FIG. 3,lane4) 
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located on chromosome 1 . The SNP Consortium Ltd database can be accessed at 
http://snp.cshl.org/, website address effective as of February 14, 2002. 
SNP HC21S00340 was amplified using the following primers: 
First primer: 

5 TAGAATAGCACTGAATTCAGGAATACAATCATTGTCAC 3' 
Second primer: 

5 * ATC ACGATAAACGGCCAAACTCAGGTTA3 ' 

SNP TSC0095512 was amplified using the following primers: 
First primer: 

5 ' AAGTTTAGATCAGAATTCGTGAAAGCAGAAGTTGTCTG 3' 

Second primer: 
5 TCTCCAACTAACGGCTC ATCGAGTAAAG 3' 

SNP TSC0214366 was amplified using the following primers: 
First primer: 

5 * ATGACTAGCTATGAATTCGTTCAAGGTAGAAAATGGAA 3' 

Second primer: 
5 'GAGAATTAGAACGGCCC AAATCCCACTC3 ' 

SNP TSC 00873 15 was amplified using the following primers: 
First primer: 

5 'TTACAATGC ATGAATTC ATCTTGGTCTCTC AAAGTGC 3' 

Second primer: 
5 TGGACC ATAAACGGCCAAAAACTGTAAG 3'. 

All primers were designed such that the 3' region was complementary to either the 
upstream or downstream sequence flanking each locus of interest and the 5' region contained a 
restriction enzyme recognition site. The first primer contained a biotin tag at the 5' end and a 
recognition site for the restriction enzyme EcoRL The second primer contained the recognition site 
for the restriction enzyme BceA I. 
PCR Reaction 

All four loci of interest were amplified from the template genomic DNA using PCR (U.S. 
Patent Nos. 4,683,195 and 4,683,202). The components of the PCR reaction were as follows: 40 ng 
of template DNA, 5 jiM first primer, 5 nM second primer, 1 X HotStarTaq Master Mix as obtained 
from Qiagen (Catalog No. 203443). The HotStarTaq Master Mix contained DNA polymerase, PCR 
buffer, 200 (oM of each dNTP, and 1.5 mM MgCl 2 . 

Amplification of each template DNA that contained the SNP of interest was performed 
using three different series of annealing temperatures, herein referred to as low stringency annealing 
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temperature, medium stringency annealing temperature, and high stringency annealing temperature. 
Regardless of the annealing temperature protocol, each PCR reaction consisted of 40 cycles of 
amplification. PCR reactions were performed using the HotStarTaq Master Mix Kit supplied by 
QIAGEN. As instructed by the manufacturer, the reactions were incubated at 95°C for 15 min. 
5 prior to the first cycle of PCR. The denaturation step after each extension step was performed at 
95°C for 30 sec. The annealing reaction was performed at a temperature that permitted efficient 
extension without any increase in temperature. 

The low stringency annealing reaction comprised three different annealing temperatures in. 
each of the first three cycles. The annealing temperature for the first cycle was 37°C for 30 sec; 
10 the annealing temperature for the second cycle was 57°C for 30 sec; the annealing temperature for 
the third cycle was 64°C for 30 sec. Annealing was performed at 64°C for subsequent cycles until 
completion. 

As shown in the photograph of the gel (FIG. 3 A), multiple bands were observed after 
amplification ofSNPTSC 0087315 (lane 4). Amplification of SNPHC21S00340 (lane 1),SNP 

15 TSC0095512 (lane 2), and SNP TSC0214366 (lane 3) generated a single band of high intensity and 
one band of faint intensity, which was of higher molecular weight. When the low annealing 
temperature conditions were used, the correct size product was generated and this was the 
predominant product in each reaction. 

The medium stringency annealing reaction comprised three different annealing 

20 temperatures in each of the first three cycles. The annealing temperature for the first cycle was 
40°C for 30 seconds; the annealing temperature for the second cycle was 60°C for 30 seconds; and 
the annealing temperature for the third cycle was 67°C for 30 seconds. Annealing was performed at 
67°C for subsequent cycles until completion. Similar to what was observed under low stringency 
annealing conditions, amplification of SNP TSC0087315 (FIG. 3B, lane 4) generated multiple 

25 bands under conditions of medium stringency. Amplification of the other three SNPs (lanes 1-3) 
produced a single band. These results demonstrate that variable annealing temperatures can be used 
to cleanly amplify loci of interest from genomic DNA with a primer that has an annealing length of 
13 bases. 

The high stringency annealing reaction was comprised of three different annealing 
30 temperatures in each of the first three cycles. The annealing temperature of the first cycle was 46°C 
for 30 seconds; the annealing temperature of the second cycle was 65°C for 30 seconds; and the 
annealing temperature for the third cycle was 72°C for 30 seconds. Annealing was performed at 
72°C for subsequent cycles until completion. As shown in the photograph of the gel (FIG. 3C), 
amplification of SNP TSC0087315 (lane 4) using the high stringency annealing temperatures 
35 generated a single band of the correct molecular weight By raising the annealing temperatures for 
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each of the first three cycles, non-specific amplification was eliminated. Amplification of SNP 

TSC0095512 (lane 2) generated a single band. SNPs HC21S00340 (lane 1), and TSC0214366 (lane 

3) failed to amplify at the high stringency annealing temperatures, however, at the medium 

stringency annealing temperatures, these SNPs amplified as a single band. These results 

5 demonstrate that variable annealing temperatures can be used to reduce non-specific PCR products, 

* 

as demonstrated for SNP TSC00873 15 (FIG. 3, lane 4). 

EXAMPLE 2 

SNPs on chromosomes 1 (TSC0095512), 13 (TSC0264580), and 21 (HC21S00027) were 

analyzed. SNP TSC0095512 was analyzed using two different sets of primers, and SNP 
1 0 HC21 S00027 was analyzed using two types of reactions for the incorporation of nucleotides. 

Preparation of Template DNA 

The template DNA was prepared from a 5 ml sample of blood obtained by venipuncture 

from a human volunteer with informed consent. Template DNA was isolated using the QIAmp 

DNA Blood Midi Kit supplied by QIAGEN (Catalog number 511 83). The template DNA was 
1 5 isolated as per instructions included in the kit. Following isolation, template DNA from thirty-six 

human volunteers were pooled together and cut with the restriction enzyme EcoRI. The restriction 

enzyme digestion was performed as per manufacturer's instructions. 

Primer Design 

SNP HC21 S00027 was amplified by PCR using the following primer set: 
20 First primer: 

5' ATAACCGTATGCGAATTCTATAATTTTCCTGATAAAGG 3' 

Second primer: 
5* CTTAAATCAGGGGACTAGGTAAACTTCA 3'. 

The first primer contained a biotin tag at the extreme 5' end, and the nucleotide sequence 
25 for the restriction enzyme EcoRI. The second primer contained the nucleotide sequence for the 
restriction enzyme BsmF I (FIG. 4A). 

Also, SNP HC21S00027 was amplified by PCR using the same first primer but a different 
second primer with the following sequence: 
Second primer: 
30 5' CTTAAATCAGACGGCTAGGTAAACTTCA 3' 

This second primer contained the recognition site for the restriction enzyme BceA I (FIG. 

4B). 

SNP TSC0095512 was amplified by PCR using the following primers: 
First primer: 

35 5' AAGTTTAGATCAGAATTCGTGAAAGCAGAAGTTGTCTG 3' 
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Second primer: 
5' TCTCCAACTAGGGACTCATCGAGTAAAG 3\ 

The first primer had a biotin tag at the 5' end and contained a restriction enzyme 
recognition site for EcoRI. The second primer contained a restriction enzyme recognition site for 
5 BsmF I (FIG 4C). 

Also, SNP TSCO095512 was amplified using the same first primer and a different second 
primer with the following sequence: 

Second primer: 
5 'TCTCCAACTAACGGCTCATCGAGTAAAG3 ' 
10 This second primer contained the recognition site for the restriction enzyme BceA I (FIG. 

4D). 

SNP TSC0264580, which is located on chromosome 13, was amplified with the following 

primers: 

First primer: 

15 5' AACGCCGGGCGAGAATTCAGTTTTTCAACTTGCAAGG 3' 
Second primer: 
5' CTACACATATCTGGGACGTTGGCCATCC 3'. 

The first primer contained a biotin tag at the extreme 5' end and had a restriction enzyme 
recognition site for EcoRI. The second primer contained a restriction enzyme recognition site for 
20 BsmF I. 

PCR Reaction 

All loci of interest were amplified from the template genomic DNA using the polymerase 
chain reaction (PCR, U.S. Patent Nos. 4,683,195 and 4,683,202, incorporated herein by reference). 
In this example, the loci of interest were amplified in separate reaction tubes but they could also be 

25 amplified together in a single PCR reaction. For increased specificity, a "hot-start" PCR was used. 
PCR reactions were performed using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog 
number 203443). The amount of template DNA and primer per reaction can be optimized for each 
locus of interest but in this example, 40 ng of template human genomic DNA and 5 |iM of each 
primer were used. Forty cycles of PCR were performed. The following PCR conditions were used: 

30 (1) 95°C for 15 minutes and 15 seconds; 

(2) 37°C for 30 seconds; 

(3) 95°C for 30 seconds; 

(4) 57°C for 30 seconds; 

(5) 95°C for 30 seconds; 
3 5 (6) 64°C for 30 seconds; 
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(7) 95°C for 30 seconds; 

(8) Repeat steps 6 and 7 thirty nine (39) times; 

(9) 72°C for 5 minutes. 

In the first cycle of PCR, the annealing temperature was about the melting temperature of 
the 3' annealing region of the second primers, which was 37°C. The annealing temperature in the 
second cycle of PCR was about the melting temperature of the 3* region, which anneals to the 
template DNA, of the first primer, which was 57°C. The annealing temperature in the third cycle of 
PCR was about the melting temperature of the entire sequence of the second primer, which was 
64°C. The annealing temperature for the remaining cycles was 64°C. Escalating the annealing 
temperature from TM1 to TM2 to TM3 in the first three cycles of PCR greatly improves specificity. 
These annealing temperatures are representative, and the skilled artisan will understand the 
annealing temperatures for each cycle are dependent on the specific primers used. 

The temperatures and times for denaturing, annealing, and extension, can be optimized by 
trying various settings and using the parameters that yield the best results. The PCR products for 
SNP HC21S00027 and SNP TSC095512 are shown in FIGS. 5A-5D. 
Purification of Fragment of Interest 

The PCR products were separated from the genomic template DNA. Each PCR product 
was divided into four separate reaction wells of a Streptawell, transparent, High-Bind plate from 
Roche Diagnostics GmbH (catalog number 1 645 692, as listed in Roche Molecular Biochemicals, 
2001 Biochemicals Catalog). The first primers contained a 5' biotin tag so the PCR products bound 
to the Streptavidin coated wells while the genomic template DNA did not. The streptavidin binding 
reaction was performed using a Thermomixer (Eppendorf) at 1000 rpm for 20 min. at 37°C. Each 
well was aspirated to remove unbound material, and washed three times with IX PBS, with gentle 
mixing (Kandpal et al., Nucl. Acids Res. 18:1789-1795 (1990); Kaneoka et al., Biotechniques 
10:30-34 (1991); Green et al., Nucl. Acids Res. 18:6163-6164 (1990)). 
Restriction Enzyme Digestion of Isolated Fragments 

The purified PCR products were digested with the restriction enzyme that bound the 
recognition site incorporated into the PCR products from the second primer. SNP HC21S00027 
(FIG. 6A and 6B) and SNP TSC0095512 (FIG. 6C and 6D) were amplified in separate reactions 
using two different second primers. FIG. 6A(SNPHC21S00027)andFIG. 6C(SNP 
TSC0095512) depict the PCR products after digestion with the restriction enzyme BsmF I (New 
England Biolabs catalog number R0572S). FIG. 6B (SNPHC21S00027)andFIG. 6D(SNP 
TSC0095512) depict the PCR products after digestion with the restriction enzyme BceA I (New 
England Biolabs, catalog number R0623 S). The digests were performed in the Streptawells 
following the instructions supplied with the restriction enzyme. SNP TSC0i64580 was digested 
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with BsmF I. After digestion with the appropriate restriction enzyme, the wells were washed three 
times with PBS to remove the cleaved fragments. 
Incorporation of Labeled Nucleotide 

The restriction enzyme digest described above yielded a DNA fragment with a 5' overhang, 
which contained the SNP site or locus of interest and a 3 ' recessed end. The 5 9 overhang 
functioned as a template allowing incorporation of a nucleotide or nucleotides in the presence of a 
DNA polymerase. 

For each SNP, four separate fill in reactions were performed; each of the four reactions 
contained a different fluorescently labeled dideoxynucleotide (ddATP, ddCTP, ddGTP, or ddTTP). 
The following components were added to each fill in reaction: 1 ^1 of a fluorescently labeled 
dideoxynucleotide, 0.5 jil of unlabeled ddNTPs ( 40 (iM), which contained all nucleotides except 
the nucleotide that was fluorescently labeled, 2 nl of 10X sequenase buffer, 0.25 \xl of Sequenase, 
and water as needed for a 20 \il reaction. All of the fill in reactions were performed at 40°C for 10 
min. Non-fluorescently labeled nucleotides was purchased from Fermentas Inc. (Hanover, MD). 
All other labeling reagents were obtained from Amersham (Thermo Sequenase Dye Terminator 
Cycle Sequencing Core Kit, US 79565). In the presence of fluorescently labeled ddNTPs, the 3 * 
recessed end was extended by one base, which corresponds to the SNP or locus of interest (FIG 
7A-7D). 

A mixture of labeled ddNTPs and unlabeled dNTPs also was used for the "fill in" reaction 
for SNP HC21 S00027. The "fill in" conditions were as described above except that a mixture 
containing 40 |jM unlabeled dNTPs, 1 pJ fluorescently labeled ddATP, 1 \xl fluorescently labeled 
ddCTP, 1 jal fluorescently labeled ddGTP, and 1 jil ddTTP was used. The fluorescent ddNTPs 
were obtained from Amersham (Thermo Sequenase Dye Terminator Cycle Sequencing Core Kit, 
US 79565; Amersham did not publish the concentrations of the fluorescent nucleotides). SNP 
HC21S00027 was digested with the restriction en2yme BsmF I, which generated a 5' overhang of 
four bases. As shown in FIG. 7E, if the first nucleotide incorporated is a labeled 
dideoxynucleotide, the 3' recessed end is filled in by one base, allowing detection of the SNP or 
locus of interest. However, if the first nucleotide incorporated is a dNTP, the polymerase continues 
to incorporate nucleotides until a ddNTP is filled in. For example, the first two nucleotides can be 
filled in with dNTPs, and the third nucleotide with a ddNTP, allowing detection of the third 
nucleotide in the overhang. Thus, the sequence of the entire 5' overhang can be determined, which 
increases the information obtained from each SNP or locus of interest. 

After labeling, each Streptawell was rinsed with IX PBS (100 jxl) three times. The "filled 
in" DNA fragments were then released from the Streptawells by digestion with the restriction 
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enzyme EcoRI, according to the manufacturer's instructions that were supplied with the enzyme 
(FIGS. 8A-8D). Digestion was performed for 1 hour at 37°C with shaking at 120 rpm. 
Detection of the Locus of Interest 

After release from the streptavidin matrix, 2-3 |il of the 10 pi sample was loaded in a 48 
5 well membrane tray (The Gel Company, catalog number TAM48-01). The sample in the tray was 
absorbed with a 48 Flow Membrane Comb (The Gel Company, catalog number AM48), and 
inserted into a 36 cm 5% acrylamide (urea) gel (BioWhittaker Molecular Applications, Long 
Ranger Run Gel Packs, catalog number 50691). 

The sample was electrophoresed into the gel at 3000 volts for 3 min. The membrane comb 
10 was removed, and the gel was run for 3 hours on an ABI 377 Automated Sequencing Machine. The 
incorporated labeled nucleotide was detected by fluorescence. 

As shown in FIG. 9A, from a sample of thirty six (36) individuals, one of two nucleotides, 
either adenosine or guanine, was detected at SNP HC21 S00027. These are the two nucleotides 
reported to exist at SNP HC21S00027 (http://snp.cshl.org/snpsearch.shtml). 
15 One of two nucleotides, either guanine or cytosine, was detected at SNP TS000955 12 (FIG. 

9B). The same results were obtained whether the locus of interest was amplified with a second 
primer that contained a recognition site for BceA I or the second primer contained a recognition site 
forBsmFI. 

As shown in FIG. 9C, one of two nucleotides was detected at SNP TSC0264580, which 
20 was either adenosine or cytosine. These are the two nucleotides reported for this SNP site 

(http://snp.cshl.org/snpsearch.shtml). In addition, a thymidine was detected one base from the locus 
of interest. In a sequence dependent manner, BsmF I cuts some DNA molecules at the 10/14 
position and other DNA molecules, which have the same sequence, at the 1 1/15 position. When the 
restriction enzyme BsmF I cuts 1 1 nucleotides away on the sense strand and 15 nucleotides away on 
25 the antisense strand, the 3' recessed end is one base from the SNP site. The sequence of SNP 
TSC0264580 indicated that the base immediately preceding the SNP site was a thymidine. The 
incorporation of a labeled ddNTP into this position generated a fragment one base smaller than the 
fragment that was cut at the 10/14 position. Thus, the DNA molecules cut at the 1 1/15 position 
provided sequence information about the base immediately preceding the SNP site, and the DNA 
30 molecules cut at the 10/14 position provided sequence information about the SNP site. 

SNP HC21 S00027 was amplified using a second primer that contained the recognition site 
for BsmF I. A mixture of labeled ddNTPs and unlabeled dNTPs was used to fill in the 5* overhang 
generated by digestion with BsmF I. If a dNTP was incorporated, the polymerase continued to 
incorporate nucleotides until a ddNTP was incorporated. A population of DNA fragments, each 
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differing by one base, was generated, which allowed the fall sequence of the overhang to be 
determined. 

As seen in FIG. 9D, an adenosine was detected, which was complementary to the 
nucleotide (a thymidine) immediately preceding the SNP or locus of interest. This nucleotide was 
5 detected because of the 1 1/15 cutting property of BsmF I, which is described in detail above. A 
guanine and an adenosine were detected at the SNP site, which are the two nucleotides reported for 
this SNP site (FIG. 9A). The two nucleotides were detected at the SNP site because the molecular 
weights of the dyes differ, which allowed separation of the two nucleotides. The next nucleotide 
detected was a thymidine, which is complementary to the nucleotide immediately downstream of 

10 the SNP site. The next nucleotide detected was a guanine, which was complementary to the 

nucleotide two bases downstream of the SNP site. Finally, an adenosine was detected, which was 
complementary to the third nucleotide downstream of the SNP site. Sequence information was 
obtained not only for the SNP site but for the nucleotide immediately preceding the SNP site and 
the next three nucleotides. 

1 5 None of the loci of interest contained a mutation. However, if one of the loci of interest 

harbored a mutation including but not limited to a point mutation, insertion, deletion, translocation 
or any combination of said mutations, it could be identified by comparison to the consensus or 
published sequence. Comparison of the sequences attributed to each of the loci of interest to the 
native, non-disease related sequence of the gene at each locus of interest determines the presence or 

20 absence of a mutation in that sequence. The finding of a mutation in the sequence is then 

interpreted as the presence of the indicated disease, or a predisposition to d6velop the same, as 
appropriate, in that individual. The relative amounts of the mutated vs. normal or non-mutated 
sequence can be assessed to determine if the subject has one or two alleles of the mutated sequence, 
and thus whether the subject is a carrier, or whether the indicated mutation results in a dominant or 

25 recessive condition. 

EXAMPLE 3 

Four loci of interest from chromosome 1 and two loci of interest from chromosome 21 were 
amplified in separate PCR reactions, pooled together, and analyzed. The primers were designed so 
that each amplified locus of interest was a different size, which allowed detection of the loci of 
30 interest. 

Preparation of Template DNA 

The template DNA was prepared from a 5 ml sample of blood obtained by venipuncture 
from a human volunteer with informed consent. Template DNA was isolated using the QIAmp 
DNA Blood Midi Kit supplied by QIAGEN (Catalog number 5 1 1 83). The template DNA was 
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isolated as per instructions included in the kit. Template DNA was isolated from thirty-six human 
volunteers, and then pooled into a single sample for further analysis. 
Primer Design 

SNP TSC 00873 15 was amplified using the following primers: 
5 First primer: 

5 'TTACAATGCATGAATTCATCTTGGTCTCTCAAAGTGC 3' 

Second primer: 
5TGGACCATAAACGGCCAAAAACTGTAAG3 \ 

SNP TSC0214366 was amplified using the following primers: 
10 First primer: • 

5'ATGACTAGCTATGAATTCGTTCAAGGTAGAAAATGGAA 3' 

Second primer: 
S'GAGAATTAGAACGGCCCAAATCCCACTC 3' 

SNP TSC 0413944 was amplified with the following primers: 
15 First primer: 

5' TACCTTTTGATCGAATTCAAGGCC AAAAATATTAAGTT 3 1 

Second primer: 
5' TCGAACTTTAACGGCCTTAGAGTAGAGA 3' 

SNP TSC0095512 was amplified using the following primers: 
20 First primer: 

5 ' AAGTTTAGATCAGAATTCGTGAAAGCAGAAGTTGTCTG 3' 

Second primer: 
5TCTCCAACTAACGGCTCATCGAGTAAAG 3' 

SNP HC21S0013 1 was amplified with the following primers: 
25 First primer: 

5' CGATTTCGATAAGAATTCAAAAGCAGTTCTTAGTTCAG 3' 

Second primer: 
5 TGCGAATCTTACGGCTGCATCAC ATTC A 3' 

SNP HC21S00027 was amplified with the following primers: 
30 First primer: 

5' ATAACCGTATGCGAATTCTATAATTTTCCTGATAAAGG 3' 

Second primer: 
5' CTTAAATCAGACGGCTAGGTAAACTTCA 3' 
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For each SNP, the first primer contained a recognition site for the restriction enzyme EcoRI 
and had a biotin tag at the extreme 5' end. The second primer used to amplify each SNP contained 
a recognition site for the restriction enzyme BceA I. 
PCR Reaction 

The PCR reactions were performed as described in Example 2 except that the following 
annealing temperatures were used: the annealing temperature for the first cycle of PCR was 37°C 
for 30 seconds, the annealing temperature for the second cycle of PCR was 57°C for 30 seconds, 
and the annealing temperature for the third cycle of PCR was 64°C for 30 seconds. All subsequent 
cycles had an annealing temperature of 64°C for 30 seconds. Thirty seven (37) cycles of PCR were 
performed. After PCR, 1/4 of the volume was removed from each reaction, and combined into a 
single tube. 

Purification of Fragment of Interest 

The PCR products (now combined into one sample, and referred to as "the sample") were 
separated from the genomic template DNA as described in Example 2 except that the sample was 
bound to a single well of a Streptawell microtiter plate. 
Restriction Enzyme Digestion of Isolated Fragments 

The sample was digested with the restriction enzyme BceA I, which bound the recognition 
site in the second primer. The restriction enzyme digestions were performed following the 
instructions supplied with the enzyme. After the restriction enzyme digest, the wells were washed 
three times with IX PBS. 
Incorporation of Nucleotides 

The restriction enzyme digest described above yielded DNA molecules with a 5 ? overhang, 
which contained the SNP site or locus of interest and a 3' recessed end. The 5' overhang 
functioned as a template allowing incorporation of a nucleotide in the presence of a DNA 
polymerase. 

The following components were used for the fill in reaction: 1 fxl of fluorescently labeled 
ddATP; 1 jil of fluorescently labeled ddTTP; 1 yl of fluorescently labeled ddGTP; 1 jLtl of 
fluorescently labeled ddCTP; 2 jwl of 10X sequenase buffer, 0.25 jul of Sequenase, and water as 
needed for a 20 yd reaction. The fill in reaction was performed at 40°C for 10 min. All labeling 
reagents were obtained from Amersham (Thermo Sequenase Dye Terminator Cycle Sequencing 
Core Kit (US 79565); the concentration of the ddNTPS provided in the kit is proprietary and not 
published by Amersham). In the presence of fluorescently labeled ddNTPs, the 3' recessed end was 
filled in by one base, which corresponds to the SNP or locus of interest • 

After the incorporation of nucleotide, the Streptawell was rinsed with IX PBS (100 ^1) 
three times. The "filled in" DNA fragments were then released from the Streptawell by digestion 
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with the restriction enzyme EcoRI following the manufacturer's instructions. Digestion was 
performed for 1 hour at 37 °C with shaking at 120 rpm. 
Detection of the Locus of Interest 

After release from the streptavidin matrix, 2-3 \xl of the 10 \i\ sample was loaded in a 48 
5 well membrane tray (The Gel Company, catalog number TAM48-01). The sample in the tray was 
absorbed with a 48 Flow Membrane Comb (The Gel Company, catalog number AM48), and 
inserted into a 36 cm 5% acrylamide (urea) gel (BioWhittaker Molecular Applications, Long 
Ranger Run Gel Packs, catalog number 50691). 

The sample was electrophoresed into the gel at 3000 volts for 3 min. The membrane comb 
10 was removed, and the gel was run for 3 hours on an ABI 377 Automated Sequencing Machine. The 
incorporated nucleotide was detected by fluorescence. 

The primers were designed so that each amplified locus of interest differed in size. As 
shown in FIG. 10, each amplified loci of interest differed by about 5-10 nucleotides, which allowed 
the loci of interest to be separated from one another by gel electrophoresis. Two nucleotides were 
1 5 detected for SNP TSC00873 1 5, which were guanine and cytosine. These are the two nucleotides 
reported to exist at SNP TSC0087315 (http://snp.cshl.org/snpsearch.shtml). The sample comprised 
template DNA from 36 individuals and because the DNA molecules that incorporated a guanine 
differed in molecular weight from those that incorporated a cytosine, distinct bands were seen for 
each nucleotide. 

20 Two nucleotides were detected at SNP HC21 S00027, which were guanine and adenosine 

(FIG. 10). The two nucleotides reported for this SNP site are guanine and adenosine 
(http://snp.cshl.org/snpsearch.shtml). As discussed above, the sample contained template DNA 
from thirty-six individuals, and one would expect both nucleotides to be represented in the sample. 
The molecular weight of the DNA fragments that incorporated a guanine was distinct from the 
25 DNA fragments that incorporated an adenosine, which allowed both nucleotides to be detected. 

The nucleotide cytosine was detected at SNP TSC0214366 (FIG. 10). The two nucleotides 
reported to exist at this SNP position are thymidine and cytosine. 

The nucleotide guanine was detected at SNP TSC04 13944 (FIG. 10). The two nucleotides 
reported for this SNP are guanine and cytosine (http://spp.cshl.org/snpsearch.shtml). 
30 The nucleotide cytosine was detected at SNP TS000955 12 (FIG. 1 0). The two nucleotides 

reported for this SNP site are guanine and cytosine (http://snp.cshl.org/snpsearch.shtml). 

The nucleotide detected at SNP HC21S00131 was guanine. The two nucleotides reported 
for this SNP site are guanine and adenosine (http://snp.cshl.org/snpsearch.shtml). 

As discussed above, the sample was comprised of DNA templates from thirty-six 
35 individuals and one would expect both nucleotides at the SNP sites to be represented. For SNP 
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TSC0413944, TSC00955 12, TSC0214366 and HC21S00131, one of the two nucleotides was 
detected. It is likely that both nucleotides reported for these SNP sites are present in the sample but 
that one fluorescent dye overwhelms the other. The molecular weight of the DNA molecules that 
incorporated one nucleotide did not allow efficient separation of the DNA molecules that 

5 incorporated the other nucleotide. However, the SNPs were readily separated from one another, 
and for each SNP, a proper nucleotide was incorporated. The sequences of multiple loci of interest 
from multiple chromosomes, which were treated as a single sample after PCR, were determined. 

A single reaction containing fluorescently labeled ddNTPs was performed with the sample 
that contained multiple loci of interest. Alternatively, four separate fill in reactions can be 

10 performed where each reaction contains one fluorescently labeled nucleotide (ddATP, ddTTP, 

ddGTP, or ddCTP) and unlabeled ddNTPs (see Example 2, FIGS. 7A-7D and FIGS. 9A-C). Four 
separate "fill in" reactions will allow detection of any nucleotide that is present at the loci of 
interest. For example, if analyzing a sample that contains multiple loci of interest from a single 
individual, and said individual is heterozygous at one or more than one loci of interest, four separate 

1 5 "fill in" reactions can be used to determine the nucleotides at the heterozygous loci of interest. 

Also, when analyzing a sample that contains templates from multiple individuals, four 
separate "fill in" reactions will allow detection of nucleotides present in the sample, independent of 
how frequent the nucleotide is found at the locus of interest. For example, if a sample contains 
DNA templates from 50 individuals, and 49 of the individuals have a thymidine at the locus of 

20 interest, and one individual has a guanine, the performance of four separate "fill in" reactions, 

wherein each "fill in" reaction is run in a separate lane of a gel, such as in FIGS. 9A-9C, will allow 
detection of the guanine. When analyzing a sample comprised of multiple DNA templates, multiple 
"fill in" reactions will alleviate the need to distinguish multiple nucleotides at a single site of 
interest by differences in mass. 

25 In this example, multiple single nucleotide polymorphisms were analyzed. It is also 

possible to determine the presence or absence of mutations, including but not limited to point 
mutations, transitions, transversions, translocations, insertions, and deletion from multiple loci of 
interest. The multiple loci of interest can be from a single chromosome or from multiple 
chromosomes. The multiple loci of interest can be from a single gene or from multiple genes. 

30 The sequence of multiple loci of interest that cause or predispose to a disease phenotype 

can be determined. For example, one could amplify one to tens to hundreds to thousands of genes 
implicated in cancer or any other disease. The primers can be designed so that each amplified loci 
of interest differs in size. After PCR, the amplified loci of interest can be combined and treated as a 
single sample. Alternatively, the multiple loci of interest can be amplified in one PCR reaction or 

35 the total number of loci of interest, for example 100, can be divided into samples, for example 10 



110 



WO 2004/078994 



PCT7US2004/006337 



loci of interest per PCR reaction, and then later pooled. As demonstrated herein, the sequence of 
multiple loci of interest can be determined. Thus, in one reaction, the sequence of one to ten to 
hundreds to thousands of genes that predispose or cause a disease phenotype can be determined. 

EXAMPLE 4 

The ability to determine the sequence or detect chromosomal abnormalities of a fetus using 
free fetal DNA in a sample from a pregnant female has been hindered by the low percentage of free 
fetal DNA. Increasing the percentage of free fetal DNA would enhance the detection of mutation, 
insertion, deletion, translocation, transversion, monosomy, trisomy, trisomy 21, trisomy 18, trisomy 
13, XXY, XXX, other aneuoplodies, deletion, addition, amplification, translocation and 
rearrangement. The percent of fetal DNA in plasma obtained from a pregnant female was 
determined both in the absence and presence of inhibitors of cell lysis. A genetic marker on the Y 
chromosome was used to calculate the percent of fetal DNA. 
Preparation of Template DNA 

The DNA template was prepared from a 5 ml sample of blood obtained by venipuncture 
from a human volunteer with informed consent. The blood was aliquoted into two tubes (Fischer 
Scientific, 9 ml EDTA Vacuette tubes, catalog number NC9897284). Formaldehyde (25 ^il/ml of 
blood) was added to one of the tubes. The sample in the other tube remained untreated, except for 
the presence of the EDTA. The tubes were spun at 1000 rpm for ten minutes. Two milliliters of the 
supernatant (the plasma) of each sample was transferred to a new tube and spun at 3000 rpm for ten 
minutes. 800 |uil of each sample was used for DNA purification. DNA was isolated using the 
Qiagen Midi Kit for purification of DNA from blood cells (QIAmp DNA Blood Midi Kit, Catalog 
number 51 183). DNA was eluted in 100 fxl of distilled water. Two DNA templates were obtained: 
one from the blood sample treated with EDTA, and one from the blood sample treated with EDTA 
and formaldehyde. 
Primer Design 

Two different sets of primers were used: one primer set was specific for the Y 
chromosome, and thus specific for fetal DNA, and the other primer set was designed to amplify the 
cystic fibrosis gene, which is present on both maternal template DNA and fetal template DNA. 

In this example, the first and second primers were designed so that the entire 5' and 3' 
sequence of each primer annealed to the template DNA. In this example, the fetus had an XY 
genotype, and the Y chromosome was used as a marker for the presence of fetal DNA. The 
following primers were designed to amplify the SRY gene on the Y chromosome. 

First primer: 
5' TGGCGATTAAGTCAAATTCGC 3* 

Second primer: 
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5 CCCCCTAGTACCCTGACAATGTATT 3' 

Primers designed to amplify any gene, or region of a region, or any part of any 
chromosome could be used to detect maternal and fetal DNA. In this example, the following 
primers were designed to amplify the cystic fibrosis gene: 
5 First primer: 

5' GTGTTCTGTGATATTATGTGTGGT 3' 

Second primer: 
5' AATTGTTGGCATTCCAGCATTG 3' 
PCR Reaction 

10 The SRY gene and the cystic fibrosis gene were amplified from the template genomic DNA 

using PCR (U.S. Patent Nos. 4,683,195 and 4,683,202). For increased specificity, a "hot-start" 
PCR was used. PCR reactions were performed using the HotStarTaq Master Mix Kit supplied by 
Qiagen (Catalog No. 203443). For amplification of the SRY gene, the DNA eluted from the 
Qiagen purification column was diluted serially 1 :2. For amplification of the cystic fibrosis gene, 

15 the DNA from the Qiagen purification column was diluted 1 :4, and then serially diluted 1 :2. The 
following components were used for each PCR reaction: 8 \il of template DNA (diluted or 
undiluted), 1 jal of each primer (5 pM), 10 jxl of HotStar Taq mix. The following PCR conditions 
were used: 

(1) 950Cforl5' 
20 (2)94°Cforl' 

(3) 54°C for 15" 

(4) 72°Cfor30" 

(5) Repeat steps 2-4 for 45 cycles. 

(6) 10' at 72°C 
25 Quantification of Fetal DNA 

The DNA templates that were eluted from the Qiagen columns weife serially diluted to the 
following concentrations: 1:2, 1:4, 1:8, 1:16, 1:32, 1:64, 1:128, 1:256, 1:512, 1:1024, 1:2048, and 
1:4096. Amplification of the SRY gene was performed using the templates that were undiluted, 
1:2, 1:4, 1:8, 1:16, 1:32, 1:64, 1:128, 1:256, 1:512. Amplification of the cystic fibrosis gene was 

30 performed using the DNA templates that were diluted 1:4, 1:8, 1:16, 1:32, 1:64, 1:128, 1:256, 
1:512, 1:1024, 1:2048, and 1:4096. The same dilution series was performed with the DNA 
templates that were purified from the plasma sample treated with EDTA alone and the plasma 
sample treated with EDTA and formaldehyde. 

The results of the PCR reactions using the DNA template that was isolated from the plasma 

35 sample treated with EDTA are shown in FIG. 11A. The SRY gene was amplified from the 

J 
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undiluted DNA template, and also in the sample that was diluted 1 :2 (FIG. 1 1 A). The SRY gene 
was not amplified in the next seven serial dilutions. On the other hand, the cystic fibrosis gene was 
detected in the serial dilutions up to 1 :256. A greater presence of the cystic fibrosis gene was 
expected because of the higher percentage of maternal DNA present in the plasma. The last 
dilution sample that provided for amplification of the gene product was assumed to have one copy 
of the cystic fibrosis gene or the SRY gene. 

The results of the PCR reactions using the DNA template that was isolated from the plasma 
sample treated with formaldehyde and EDTA are shown in FIG. 1 IB. The SRY gene was 
amplified from the undiluted DNA template, and also in the sample that was diluted 1:2 (FIG. 
1 IB). The SRY gene was not amplified in the next six dilutions. However, in the 1 :256 dilution, 
the SRY gene was detected. It is unlikely that the amplification in the 1 :256 sample represents a 
real signal because the prior six dilution series were all negative for amplification of SRY. 
Amplification of the SRY gene in this sample was likely an experimental artifact resulting from the 
high number of PCR cycles used. Thus, the 1 :256 sample was not used in calculating the amount of 
fetal DNA present in the sample. 

Amplification of the cystic fibrosis gene was detected in the sample that was diluted 1:16 
(FIG. 11B). The presence of the formalin prevents maternal cell lysis, and thus, there is a lower 
percentage of maternal DNA in the sample. This is in strong contrast to the sample that was treated 
with only EDTA, which supported amplification up to a dilution of 1:256. 

The percent of fetal DNA present in the maternal plasma was calculated using the 
following formula: 

% fetal DNA = (amount of SRY gene/amount of cystic fibrosis gene)*2 ,:i 100. 
The amount of SRY gene was represented by the highest dilution value in which the gene was 
amplified. Likewise, the amount of cystic fibrosis gene was represented by the highest dilution 
value in which it was amplified. The formula contains a multiplication factor of two (2), which is 
used to normalize for the fact that there is only one copy of the SRY gene (located on the Y 
chromosome), while there are two copies of the cystic fibrosis gene. 

For the above example, the percentage of fetal DNA present in the sample that was treated 
with only EDTA was 1.56 % (2/256 * 2 *100). The reported percentage of fetal DNA present in 
the plasma is between 0.39-1 1.9 % (Pertl and Bianchi, Obstetrics and Gynecology, Vol PS, No. 3, 
483-490 (2001). The percentage of fetal DNA present in the sample treated with formalin and 
EDTA was 25% (2/16 * 2 * 100). The experiment was repeated numerous times, and each time the 
presence of formalin increased the overall percentage of fetal DNA. 

The percent fetal DNA from eighteen blood samples with and without formalin was 
calculated as described above with the exception that serial dilutions of 1:5 were performed. As 1:5 
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dilutions were performed, the last serial dilution that allowed detection of either the SRY gene or 
the cystic fibrosis gene may have had one copy of the gene or it may have had 4 copies of the gene. 
The results from the eighteen samples with and without formalin are summarized in Table V. The 
low range assumes that the last dilution sample had one copy of the genes and the high range 
assumes that the last dilution had four copies of the genes. 



Table V. Mean Percentage Fetal DNA with and without formalin. 



Sample 


Lower Range 


Upper Range 


Formalin 


19.47 


43.69 


Without Formalin 


7.71 


22.1 



An overall increase in fetal DNA was achieved by reducing the maternal cell lysis, and 
thus, reducing the amount of maternal DNA present in the sample. In this example, formaldehyde 
was used to prevent lysis of the cells, however any agent that prevents the lysis of cells or increases 
the structural integrity of the cells can be used. Two or more than two cell lysis inhibitors can be 
used. The increase in fetal DNA in the maternal plasma allows the sequence of the fetal DNA to be 
determined, and provides for the rapid (detection of abnormal DNA sequences or chromosomal 
abnormalities including but not limited to point mutation, reading frame shift, transition, 
transversion, addition, insertion, deletion, addition-deletion, frame-shift, missense, reverse 
mutation, and microsatellite alteration, trisomy, monosomy, other aneuploidies, amplification, 
rearrangement, translocation, transversion, deletion, addition, amplification, fragment, 
translocation, and rearrangement. 

EXAMPLES 

A DNA template from an individual with a genotype of trisomy 21 was analyzed. Three 
loci of interest were analyzed on chromosome 13 and two loci of interest were analyzed on 
chromosome 21. 

Preparation of Template DNA 

The template DNA was prepared from a 5 ml sample of blood obtained by venipuncture 
from a human volunteer with informed consent. The human volunteer had previously been 
genotyped to have an additional chromosome 21 (trisomy 21). Template DNA was isolated using 
QIAamp DNA Blood Midi Kit supplied by QIAGEN (Catalog number 5 1 1 83). 
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Primer Design 

The following five single nucleotide polymorphisms were analyzed: SNP TSC 01 15603 
located on chromosome 21; SNP TSC 03209610 located on chromosome 21; SNP TSC 0198557 
located on chromosome 13; and SNP TSC 0200347 located on chromosome 13. The DNA 
template from another individual was used as an internal control. The SNP TSC 0200347, which 
was previously identified as being homozygous for guanine, was used as the internal control. The 
SNP Consortium Ltd database can be accessed at http://snp.cshl.org/, website address effective as 
of April 1,2002. 

SNP TSC 01 15603 was amplified using the following primers: 
First Primer: 

5' GTGCACTTACGTGAATTCAGATGAACGTGATGTAGTAG 3' 

Second Primer: 
5' TCCTCGTACTCAACGGCTTTCTCTGAAT 3' 

The first primer was biotinylated at the 5 ' end, and contained the restriction enzyme 
recognition site for EcoR I. The second primer contained the restriction enzyme recognition site for 
the restriction enzyme BceA I. 

SNP TSC 0309610 was amplified using the following primers: 
First primer: 

5> TCCGGAACACTAGAATTCTTATTTACATACACACTTGT V 
Second primer: 

5 J CGAATAAGGTAGACGGCAACAATGAGAA 3' 

The first primer contained a biotin group at the 5' end, and a restriction enzyme recognition 
site for the restriction enzyme EcoR I. The second primer contained the restriction eiuzyme 
recognition site for BceA L 

Submitted SNP (ss) 813773 (accession number assigned by the NCBI Submitted SNP (ss) 
Database) was amplified with the following primers: 
First primer: 

S y CGGTAAATCGGAGAATTCAGAGGATTTAGAGGAGCTAA 3' 
Second primer: 

5' CTCACGTTCGTTACGGCCATTGTGATAGC 3» 

The first primer contains a biotin group at the 5' end, and a recognition site for the 
restriction enzyme EcoR I. The second primer contained the restriction enzyme recognition site for 
BceA I. 

SNP TSC 0198557 was amplified with the following primers: 
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5' GGGGAAACAGTAGAATTCCATATGGACAGAGCTGTACT 3' 

Second primer: 
5' TGAAGCTGTCGGACGGCCTTTGCCCTCTC 3' 
5 The first primer contains a biotin group at the 5 9 end, and a recognition site for the 

restriction enzyme EcoR I. The second primer contained the restriction enzyme recognition site for 
BceA I. 

SNP TSC 0197279 was amplified with the following primers: 
First primer: 

10 5' ATGGGCAGTTATGAATTCACTACTCCCTGTAGCTTGTT 3' 
Second primer: 
5' TGATTGGCGCGAACGGCACTCAGAGAAGA 3* 

The first primer contained a biotin group at the 5* end, and a recognition site for the 
restriction enzyme for EcoR I. The second primer contained the recognition site for the restriction 
15 enzyme BceA I. 

SNP TSC 0200347 was amplified with the following primers: 
First primer: 

5' CTCAAGGGGACCGAATTCGCTGGGGTCTTCTGTGGGTC 3' 
Second primer: 
20 5' TAGGGCGGCGTGACGGCCAGCC AGTGGT 3 ' 

The first primer contained a biotin group at the 5' end, and the recognition site for the 
restriction enzyme EcoR I. The second primer contained the restriction enzyme recognition site for 
BceA I. 

25 PCR Reaction 

All five loci of interest were amplified from the template genomic DNA using PCR (U.S. 
Patent Nos. 4,683, 195 and 4,683,202). For increased specificity, a "hot-start" PCR was used. PCR 
reactions were performed using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog 
number 203443). The amount of template DNA and primer per reaction can be optimized for each 
30 locus of interest; in this example, 40 ng of template human genomic DNA and 5 ^iM of each primer 
were used. Thirty-eight cycles of PCR were performed. The following PCR conditions were used 
for SNP TSC 01 15603, SNP TSC 0309610, and SNP TSC 02003437: 

(1) 95°C for 15 minutes and 15 seconds; 

(2) 42°C for 30 seconds; 
35 (3) 95°C for 30 seconds; 
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(4) 60°Cfor30 seconds; 

(5) 95°Cfor30 seconds; 

(6) 69°C for 30 seconds; 

(7) 95°C for 30 seconds; 



5 



(8) Repeat steps 6 and 7 thirty nine (37) times; 



(9) 72°C for 5 minutes. 
The following PCR conditions were used for SNP ss813773, SNP TSC 0198557, and SNP 
TSC 0197279: 

(1) 95°C for 15 minutes and 15 seconds; 



(8) Repeat steps 6 and 7 thirty nine (37) times; and 

(9) 72°C for 5 minutes. 

In the first cycle of each PCR, the annealing temperature was about the melting temperature 
of the 3 ' annealing region of the second primer. The annealing temperature in the second cycle of 

20 PCR was about the melting temperature of the 3' region, which anneals to the template DNA, of the 
first primer. The annealing temperature in the third cycle of PCR was about the melting 
temperature of the entire sequence of the second primer. Escalating the annealing temperature from 
TM1 to TM2 to TM3 in the first three cycles of PCR greatly improves specificity. These annealing 
temperatures are representative, and the skilled artisan will understand the annealing temperatures 

25 for each cycle are dependent on the specific primers used. The temperatures and times for 

denaturing, annealing, and extension, can be optimized by trying various settings and using the 
parameters that yield the best results. 

r 

Purification of Fragment of Interest 

30 PCR products were separated from the components of the PCR reaction using Qiagen's 

MinElute PCR Purification Kit following manufacturer's instructions (Catalog number 28006). 
The PCR products were eluted in 20 |xl of distilled water. For each amplified SNP, one microliter 
of PCR product, 1 nl of amplified internal control DNA (SNP TSC 0200347), and 8 jd of distilled 
water were mixed. Five microliters of each sample was placed into two separate reaction wells of a 

35 Pierce StreptaWell Microtiter plate (catalog number 15501). The first primers contained a 5' biotin 



10 



15 



(2) 37°Cfor30 seconds; 

(3) 95°C for 30 seconds; 

(4) 57°Cfor 30 seconds; 

(5) 95°C for 30 seconds; 

(6) 64°C for 30 seconds; 

(7) 95°C for 30 seconds; 
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tag so the PCR products bound to the Streptavidin coated wells while the genomic template DNA 
did not. The streptavidin binding reaction was performed using a Thermomixer (Eppendorf) at 150 
rpm for 1 hour at 45°C. Each well was aspirated to remove unbound material, and washed three 
times with IX PBS, with gentle mixing (Kandpal et al., Nucl. Acids Res. 18:1789-1795 (1990); 
Kaneoka et al., Biotechniques 10:30-34 (1991); Green et al., Nucl. Acids Res. 18:6163-6164 
(1990)). 

Restriction Enzyme Digestion of Isolated Fragments 

The purified PCR products were digested with the restriction enzyme that bound the 
recognition site that was incorporated into the PCR products from the second primer. The purified 
PCR products were digested with the restriction enzyme BceA I (New England Biolabs, catalog 
number R0623S). The digests were performed in the wells of the microtiter plate following the 
instructions supplied with the restriction enzyme. After digestion with the appropriate restriction 
enzyme, the wells were washed three times with PBS to remove the cleaved fragments. 

Incorporation of Labeled Nucleotide 

The restriction enzyme digest described above yielded a DNA fragment with a 5' overhang, 
which contained the SNP and a 3' recessed end. The 5' overhang functioned as a template allowing 
incorporation of a nucleotide or nucleotides in the presence of a DNA polymerase. 

For each SNP, two fill in reactions were performed; each reaction contained a different 
fluorescently labeled dideoxynucleotide (ddATP, ddCTP, ddGTP, or ddTrf, depending on the 
reported nucleotides to exist at a particular SNP). For example, the nucleotides adenine and 
thymidine have been reported at SNP TSC 01 15603. Therefore, the digested PCR product for SNP 
TSC 01 15603 was mixed with either fluorescently labeled ddATP or fluorescently labeled ddTTP. 
Each reaction contained fluorescently labeled ddGTP for the internal control. The following 
components were added to each fill in reaction: 2 fxl of a ROX-conjugated dideoxynucleotide 
(depending on the nucleotides reported for each SNP), 2 ^1 of ROX-conjugated ddGTP (internal 
control), 2.5 ^il of 10X sequenase buffer, 2 pi of Sequenase, and water as needed for a 25 nl 
reaction. All of the fill in reactions were performed at 45°C for 45 min. However, shorter time 
periods of incorporation can be used. Non-fluorescently labeled ddNTPs were purchased from 
Fermentas Inc. (Hanover, MD). The ROX-conjugated ddNTPs were obtained from Perkin Elmer. 
In the presence of fluorescently labeled ddNTPs, the 3 1 recessed end was extended by one base, 
which corresponds to the SNP or locus of interest. 

After labeling, each Streptawell was rinsed with IX PBS (100 jd) three times. The "filled 
in" DNA fragments were then released from the Streptawells by digestion with the restriction 
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enzyme EcoR I following manufacturer's recommendations. Digestion was performed for 1 hour at 
37°C with shaking at 120 ipm. 

Detection of the Locus of Interest 

5 After release from the streptavidin matrix, 3 of the 10 fxl sample was loaded in a 48 well 

membrane tray (The Gel Company, catalog number TAM48-01). The sample in the tray was 
absorbed with a 48 Flow Membrane Comb (The Gel Company, catalog number AM48), and 
inserted into a 36 cm 5% acrylamide (urea) gel (BioWhittaker Molecular Applications, Long 
Ranger Run Gel Packs, catalog number 50691). 

10 The sample was electrophoresed into the gel at 3000 volts for 3 min. The membrane comb 

was removed, and the gel was run for 3 hours on an ABI 377 Automated Sequencing Machine. The 
incorporated labeled nucleotide was detected by fluorescence. 

As seen in FIG. 12, SNP TSC 0115603 was "filled in" with labeled ddTTP (lane 1) and in 
a separate reaction with labeled ddATP (lane 3). The calculated ratio between the nucleotides, 

15 using the raw data, was 66:34, which is consistent with the theoretical ratio of 66:33 for a SNP on 
chromosome 21 in an individual with trisomy 21. Both the ddTTP and ddATP were labeled with 
the same fluorescent dye to minimize variability in incorporation efficiencies of the dyes. 
However, nucleotides with different fluorescent labels or any detectable label can be used. It is 
preferable to calculate the coefficients of incorporation when different labels are used. 

20 Each fill in reaction was performed in a separate well so it was possible that there could be 

variability in DNA binding between the wells of the microtiter plate. To account for the potential 
variability of DNA binding to the streptavidin-coated plates, an internal control was used. The 
internal control (SNP TSC 0200347), which is homozygous for guanine, was added to the sample 
prior to splitting the sample into two separate wells, and thus, an equal amount of the internal 

25 control should be present in each well. The amount of incorporated ddGTP, can be fixed between 
the two reactions. If the amount of DNA in each well is equal, the amount of incorporated ddGTP 
should be equal because the reaction is performed under saturating conditions, with saturating 
conditions being defined as conditions that support incorporation of a nucleotide at each template 
molecule. Using the internal control, the ratio of incorporated ddATP to ddTTP was 63.4:36.6. 

30 This ratio was very similar to the ratio obtained with the raw data, indicating that there are minor 
differences in the two fill in reactions for a particular SNP. 

Table VI. Allele Frequencies at Multiple SNPs on DNA Template from Individual with 



Trisomy 21 
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SNP TSC 0309610 was filled in with ddTTP (lane 3) or ddCTP (lane 4) (FIG. 12). The 
calculated ratio for the nucleotides, using the raw data, was 64:36. Both ddTTP and ddCTP were 
labeled with the same fluorescent dye. After normalization to the internal control, as discussed 
5 above, the calculated allele ratio of ddTTP to ddCTP was 66.8:33 .2 (Table VI). Again, both the 
calculated ratio from the raw data and the calculated ratio using the internal control are very similar 
to the theoretical ratio of 66.6:33.4 for a SNP on chromosome 21 in an individual with trisomy. 

To demonstrate that the 66:33 ratios for nucleotides at heterozygous SNPS represented loci 
on chromosomes present in three copies, SNPs on chromosome 13 were analyzed. The individual 
10 from whom the blood sample was obtained had previously been genolyped with one maternal 
chromosome 13, and one paternal chromosome 13. 

Submitted SNP (ss) 813773 was filled in with ddATP (lane 5) or ddCTP (lane 6) (FIG. 
12). The calculated ratio for the nucleotides at this heterozygous SNP, using the raw data, was 
46:54. This ratio is within 10% of the expected ratio of 50:50. Importantly, the ratio does not 
15 approach the 66:33 ratio expected when there is an additional copy of a chromosome. 

After normalization to the internal control, the calculated ratio was 41:59. Contrary to the 
expected result, normalization to the internal control increased the discrepancy between the 
calculated ratio and the theoretical ratio. This result may represent experimental error that occurred 
in aliquoting the DNA samples. 
20 Also, it is possible that the restriction enzyme used to generate the overhang, which was 

used as a template for the "fill-in" reaction, preferentially cut one DNA template over the other 
DNA template. The two templates differ, with respect to the nucleotide at the SNP site, and this 
may influence the cutting. The primers can be designed such that the nucleotides adjacent to the cut 
site are the same, independent of the nucleotide at the SNP site (discussed further in the section 
25 entitled "Primer Design"). 

SNP TSC 0198557, which is on chromosome 13, was filled in with ddTTP (tone 7) ^ one 
reaction and ddCTP (lane 8) in another (FIG. 12). The calculated ratio for the nucleotides at this 
SNP, using the raw data, was 55:45. After normalization to the internal control, the calculated 

120 



WO 2004/078994 



PCT/US2004/006337 



allele ratio of T:C was 49:51 . The normalized ratio was closer to the theoretical ratio of 50:50 for 
an individual with two copies of chromosome 13. 

SNP TSC 0197279, which is on chromosome 13, was filled in with ddTTP (lane 9) in one 
reaction and ddCTP (lane 10) in another (FIG. 12). The calculated ratio for the nucleotides at this 
SNP, using the raw data, was 53:47. After normalization to the internal control, the calculated 
allele ratio of T:C was 50.7:49.3. This is consistent with the theoretical ratio of 50:50 for an 
individual with only two copies of chromosome 13. 

The ratio for the nucleotides at two of the analyzed SNPs on chromosome 13 was 
approximately 50:50. One SNP, ss813773, showed a ratio of 46:54, and when normalized to the 
internal control, the ratio was 41:59. These ratios deviate from the expected 50:50, but at the same 
time, the ratios are not indicative of an extra chromosome, which is indicated with a ratio of 66:33. 
While the data from this particular SNP is inconclusive, it does not represent a false positive. No 
conclusion could be drawn on the data from this SNP. However, the other two SNPs provided data 
that indicated a normal number of chromosomes. It is preferable to analyze multiple SNPs on a 
chromosome including but not limited to 1-5, 5-10, 10-50, 50-100, 100-200, 200-300, 300-400, 
400-500, 500-600, 600-700, 700-800, 800-900, 900-1000, 1000-2000, 2000-3000, and greater than 
3000. Preferably, the average of the ratios for a particular chromosome will be used to determine 
the presence or absence of a chromosomal abnormality. However, it is still possible to analyze one 
locus of interest. In the event that inconclusive data is obtained, another locus of interest can be 
analyzed. 

The individual from whom the DNA template was obtained had previously been genotyped 
with trisomy 21, and the allele frequencies at SNPs on chromosome 21 indicate the presence of an 
additional chromosome 21. The additional chromosome contributes an additional nucleotide for 
each SNP, and thus alters the traditional 50:50 ratio at a heterozygous SNP. These results are 
consistent for multiple SNPs, and are specific for those found on chromosome 2 1 . The allele 
frequencies for SNPs on chromosome 13 gave the expected ratios of approximately 50:50. These 
results demonstrate that this method of SNP detection can be used to detect chromosomal 
abnormalities including but not limited to translocations, transversions, monosomies, trisomy 21, 
trisomy 18, trisomy 13, other aneuplodies, deletions, additions, amplifications, translocations and 
rearrangements. 

EXAMPLE 6 

Genomic DNA was obtained from four individuals after informed consent was obtained. 
Six SNPs on chromosome 13 (TSC0837969, TSC0034767, TSC 1 130902, TSC0597888, 
TSC0195492, TSC0607185) were analyzed using the template DNA. Information regarding these 
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SNPs can be found at the following website www.snp.chsl.org/snpsearch.shtml; website active as of 
February 11,2003). 

A single nucleotide labeled with one fluorescent dye was used to genotype the individuals 
at the six selected SNP sites. The primers were designed to allow the six SNPs to be analyzed in a 
5 single reaction. 

Preparation of Template DNA 

The template DNA was prepared from a 9 ml sample of blood obtained by venipuncture 
from a human volunteer with informed consent. Template DNA was isolated using the QIAmp 
10 DNA Blood Midi Kit supplied by QIAGEN (Catalog number 5 1 1 83). The template DNA was 
isolated as per instructions included in the kit. 

Design of Primers 

SNP TSC0837969 was amplified using the following primer set: 

15 

First primer: 

5' GGGCTAGTCTCCGAATTCCACCTATCCTACCAAATGTC 3' 
20 Second primer: 

5' TAGCTGTAGTTAGGGACTGTTCTGAGCAC 3' 

25 The first primer had a biotin tag at the 5' end and contained a restriction enzyme recognition 

site for EcoRI. The first primer was designed to anneal 44 bases from the locus of interest. The 
second primer contained a restriction enzyme recognition site for BsmF I. 

30 SNP TSC0034767 was amplified using the following primer set: 

First primer: 

5' CGAATGCAAGGCGAATTCGTTAGTAATAACACAGTGCA 3' 

35 
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Second primer: 

5' AAGACTGGATCCGGGACCATGTAGAATAC 3 9 

The first primer had a biotin tag at the 5* end and contained a restriction enzyme recognition 
site for EcoRI. The first primer was designed to anneal 50 bases from the locus of interest. The 
second primer contained a restriction enzyme recognition site for BsmF L 

SNP TSC1 130902 was amplified using the following primer set: 

First primer: 

5' TCTAACCATTGCGAATTCAGGGCAAGGGGGGTGAGATC 3' 
Second primer: 

5' TGACTTGGATCCGGGACAACGACTCATCC 3' 

The first primer had a biotin tag at the 5 ! end and contained a restriction enzyme recognition 
site for EcoRI. The first primer was designed to anneal 60 bases from the locus of interest. The 
second primer contained a restriction enzyme recognition site for BsmF L 

SNP TSC0597888 was amplified using the following primer set: 

First primer: 

5' ACCCAGGCGCCAGAATTCTTTAGATAAAGCTGAAGGGA 3' 
Second primer: 

5' GTTACGGGATCCGGGACTCCATATTGATC 3' 

The first primer had a biotin tag at the 5' end and contained a restriction enzyme recognition 
site for EcoRI. The first primer was designed to anneal 70 bases from the locus of interest. The 
second primer contained a restriction enzyme recognition site for BsmF I. 
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SNP TSC0195492 was amplified using the following primer set: 
5 First primer: 

5 'CGTTGGCTTGAGGAATTCGACCAAAAGAGCCAAGAGAA 

10 Second primer: 

5' AAAAAGGGATCCGGGACCTTGACTAGGAC 3' 

The first primer had a biotin tag at the 5' end and contained a restriction enzyme recognition 
15 site for EcoRI. The first primer was designed to anneal 80 bases from the locus of interest. The 
second primer contained a restriction enzyme recognition site for BsmF I. 

SNP TSC0607185 was amplified using the following primer set: 

20 First primer: 

5' ACTTGATTCCGTGAATTCGTTATCAATAAATCTTACAT 3' 

Second primer: 

25 

5 s CAAGTTGGATCCGGGACCCAGGGCTAACC 3' 

The first primer had a biotin tag at the 5 f end and contained a restriction enzyme recognition 
site for EcoRI. The first primer was designed to anneal 90 bases from the locus of interest. The 
30 second primer contained a restriction enzyme recognition site for BsmF I. 

All loci of interest were amplified from the template genomic DNA using the polymerase 
chain reaction (PCR, U.S. Patent Nos. 4,683,195 and 4,683,202, incorporated herein by reference). 
In this example, the loci of interest were amplified in separate reaction tubes but they could also be 
amplified together in a single PCR reaction. For increased specificity, a "hot-start" PCR was used. 
35 PCR reactions were performed using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog 
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number 203443). The amount of template DNA and primer per reaction can be optimized for each 
locus of interest but in this example, 40 ng of template human genomic DNA and 5 nM of each 
primer were used. Forty cycles of PCR were performed. The following PCR conditions were used: 
(1) 95°C for 15 minutes and 15 seconds; 
5 (2) 37°C for 30 seconds; 

(3) 95°C for 30 seconds; 

(4) 57°C for 30 seconds; 

(5) 95°Cfor30 seconds; 

(6) 64°Cfor30 seconds; 
10 (7) 95°C for 30 seconds; 

(8) Repeat steps 6 and 7 thirty nine (39) times; 

(9) 72°C for 5 minutes. 

In the first cycle of PCR, the annealing temperature was about the melting temperature of 
the 3 ' annealing region of the second primers, which was 37°C. The annealing temperature in the 

1 5 second cycle of PCR was about the melting temperature of the 3 1 region, which anneals to the 

template DNA, of the first primer, which was 57°C. The annealing temperature in the third cycle of 
PCR was about the melting temperature of the entire sequence of the second primer, which was 
64°C. The annealing temperature for the remaining cycles was 64°C. Escalating the annealing 
temperature from TM1 to TM2 to TM3 in the first three cycles of PCR greatly improves specificity. 

20 These annealing temperatures are representative, and the skilled artisan will understand the 
annealing temperatures for each cycle are dependent on the specific primers used. 

The temperatures and times for denaturing, annealing, and extension, can be optimized by 
trying various settings and using the parameters that yield the best results. In this example, the first 
primer was designed to anneal at various distances from the locus of interest. The skilled artisan 

25 understands that the annealing location of the first primer can be 5-10, 1 1-15, 16-20, 21-25, 26-30, 
31-35, 36-40, 41-45, 46-50, 51-55, 56-60, 61-65, 66-70, 71-75, 76-80, 81-85, 86-90, 91-95, 96-100, 
101-105, 106-110, 111-115, 116-120, 121-125, 126-130, 131-140, 1410-160, 1610-180, 1810-200, 
2010-220, 2210-240, 2410-260,. 2610-280,. 2810-300, 3010-350, 3510-400, 4010-450, 450-500, or 
greater than 500 bases from the locus of interest 

30 Purification of Fragment of Interest 

The PCR products were separated from the genomic template DNA. After the PCR 
reaction, 1//4 of the volume of each PCR reaction from one individual was mixed together in a well 
of a Streptawell, transparent, High-Bind plate from Roche Diagnostics GmbH (catalog number 1 
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645 692, as listed in Roche Molecular Biochemicals, 2001 Biochemicals Catalog). The first 
primers contained a 5' biotin tag so the PCR products bound to the Streptavidin coated wells while 
the genomic template DNA did not. The streptavidin binding reaction was performed using a 
Thermomixer (Eppendorf) at 1000 rpm for 20 min. at 37°C. Each well was aspirated to remove 
5 unbound material, and washed three times with IX PBS, with gentle mixing (Kandpal et aL, Nucl. 
Acids Res. 18:1789-1795 (1990); Kaneoka et aL, Biotechniques 10:30-34 (1991); Green et aL, 
Nucl. Acids Res. 18:6163-6164 (1990)). 

Restriction Enzyme Digestion of Isolated Fragments 

The purified PCR products were digested with the restriction enzyme BsmF I, which binds 
10 to the recognition site incorporated into the PCR products from the second primer. The digests 
were performed in the Streptawells following the instructions supplied with {he restriction enzyme. 
After digestion, the wells were washed three times with PBS to remove the cleaved fragments. 
Incorporation of Labeled Nucleotide 

The restriction enzyme digest with BsmF I yielded a DNA fragment with a 5 f overhang, 
15 which contained the SNP site or locus of interest and a 3* recessed end. The 5 1 overhang functioned 
as a template allowing incorporation of a nucleotide or nucleotides in the presence of a DNA 
polymerase. 

Below, a schematic of the 5* overhang for SNP TSC0837969 is shown. The entire DNA 
sequence is not reproduced, only the portion to demonstrate the overhang (where R indicates the 
20 . variable site). 

5'TTAA 

3' AATT R A C A 
Overhang position 12 3 4 

25 

The observed nucleotides for TSC0837969 on the 5' sense strand (here depicted as the top 
strand) are adenine and guanine. The third position in the overhang on the ahtisense strand 
corresponds to cytosine, which is complementary to guanine. As this variable site can be adenine 
or guanine, fluorescently labeled ddGTP in the presence of unlabeled dCTP, dTTP, and dATP was 
30 used to determine the sequence of both alleles. The fill-in reactions for an individual homozygous 
for guanine, homozygous for adenine or heterozygous are diagrammed below. 

Homozygous for guanine at TSC 0837969: 
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Allele 1 5' TTAA G* 

3'AATT C A C A 
Overhang position 12 3 4 



Allele 2 5' TTAA G* 

3'AATT C A C A 
Overhang position 12 3 4 



10 Labeled ddGTP is incorporated into the first position of the overhang. Only one signal is 

seen, which corresponds to the molecules filled in with labeled ddGTP at the first position of the 
overhang. 



15 



Homozygous for adenine at TSC 0837969: 

Allele 1 5* TTAA A T G* 

3 5 AATT T A C A 
Overhang position 1 2 3 4 



20 Allele 2 5' TTAA A T G* 

3'AATT T A C A 
Overhang position 12 3 4 

Unlabeled dATP is incorporated at position one of the overhang, and unlabeled dTTP is 
25 incorporated at position two of the overhang. Labeled ddGTP was incorporated at position three of 
the overhang. Only one signal will be seen; the molecules filled in with ddGTP at position 3 will 
have a different molecular weight from molecules filled in at position one, which allows easy 
identification of individuals homozygous for adenine or guanine. 

30 Heterozygous at TSC0837969: 

Allele 1 5' TTAA G* 

3'AATT C A C A 



Overhang position 12 3 4 



35 
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Allele 2 5' TTAA A T G* 

3'AATT T A C A 
Overhang position 12 3 4 

Two signals will be seen; one signal corresponds to the DNA molecules filled in with 
ddGTP at position 1, and a second signal corresponding to molecules filled in at position 3 of the 
overhang. The two signals can be separated using any technique that separates based on molecular 
weight including but not limited to gel electrophoresis. 

Below, a schematic of the 5' overhang for SNP TSC0034767 is shown. The entire DNA 
sequence is not reproduced, only the portion to demonstrate the overhang (where R indicates the 
variable site). 

A C A R GTGT3* 

CACA 5' 

4 3 2 1 Overhang Position 

The observed nucleotides for TSC0034767 on the 5' sense strand (here depicted as the top 
strand) are cytosine and guanine. The second position in the overhang corresponds to adenine, 
which is complementary to thymidine. The third position in the overhang corresponds to cytosine, 
which is complementary to guanine. Fluorescently labeled ddGTP in the presence of unlabeled 
dCTP, dTTP, and dATP is used to determine the sequence of both alleles. 

In this case, the second primer anneals upstream of the locus of interest, and thus the fill-in 
reaction occurs on the anti-sense strand (here depicted as the bottom strand). Either the sense strand 
or the antisense strand can be filled in depending on whether the second primer, which contains the 
type IIS restriction enzyme recognition site, anneals upstream or downstream of the locus of 
interest. 

Below, a schematic of the 5' overhang for SNP TSC1 130902 is shown. The entire DNA 
sequence is not reproduced, only a portion to demonstrate the overhang (where R indicates the 
variable site). 

5'TTCAT 

3'AAGTA R T C C 
Overhang position 12 3 4 
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The observed nucleotides for TSC1 130902 on the 5' sense strand (here depicted as the top 
strand) are adenine and guanine. The second position in the overhang corresponds to a thymidine, 
and the third position in the overhang corresponds to cytosine, which is complementary to guanine. 

Fluorescently labeled ddGTP in the presence of unlabeled dCTP, dTTP, and dATP is used 
to determine the sequence of both alleles. 

Below, a schematic of the 5' overhang for SNP TSC0597888 is shown. The entire DNA 
sequence is not reproduced, only the portion to demonstrate the overhang (where R indicates the 
variable site). 

T C T R ATTC3* 

TAAG5' 

4 3 2 1 Overhang position 

The observed nucleotides for TSC0597888 on the 5' sense strand (here depicted as the top 
strand) are cytosine and guanine. The third position in the overhang corresponds to cytosine, which 
is complementary to guanine. Fluorescently labeled ddGTP in the presence of unlabeled dCTP, 
dTTP, and dATP is used to determine the sequence of both alleles. 

Below, a schematic of the 5' overhang for SNP TSC0607185 is shown. The entire DNA 
sequence is not reproduced, only the portion to demonstrate the overhang (where R indicates the 
variable site). 

C C T R TGTC3' 

ACAG 5' 

4 3 2 1 Overhang position? 

The observed nucleotides for TSC0607185 on the 5' sense strand (here depicted as the top 
strand) are cytosine and thymidine. In this case, the second primer anneals from the locus of 
interest, which allows the anti-sense strand to be filled in. The anti-sense strand (here depicted as 
the bottom strand) will be filled in with guanine or adenine. 

The second position in the 5' overhang is thymidine, which is complementary to adenine, 
and the third position in the overhang corresponds to cytosine, which is complementary to guanine. 
Fluorescently labeled ddGTP in the presence of unlabeled dCTP, dTTP, and dATP is used to 
determine the sequence of both alleles. 

Below, a schematic of the 5' overhang for SNP TSC0195492 is shown. The entire DNA 
sequence is not reproduced, only the portion to demonstrate the overhang. 
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5' ATCT 

3' TAG A R A C A 
Overhang position 12 3 4 

5 

The observed nucleotides at this site are cytosine and guanine (here depicted as the top 
strand) . The second position in the 5' overhang is adenine, which is complementary to thymidine, 
and the third position in the overhang corresponds to cytosine, which is complementary to guanine. 
Fluorescently labeled ddGTP in the presence of unlabeled dCTP, dTTP, and dATP is used to 

10 determine the sequence of both alleles. 

As demonstrated above, the sequence of both alleles of the six SNPs can be determined by 
labeling with ddGTP in the presence of unlabeled dATP, dTTP, and dCTP. The following 
components were added to each fill in reaction: 1 ill of fluorescently labeled ddGTP, 0.5 \il of 
unlabeled dNTPs ( 40 \M) 9 which contained all nucleotides except guanine, 2 \x\ of 10X sequenase 

15 buffer, 0.25 \il of Sequenase, and water as needed for a 20^1 reaction. The fill in reaction was 
performed at 40°C for 10 min. Non-fluorescently labeled dNTP was purchased from Fermentas 
Inc. (Hanover, MD). All other labeling reagents were obtained from Amersham (Thermo 
Sequenase Dye Terminator Cycle Sequencing Core Kit, US 79565). 

After labeling, each Streptawell was rinsed with IX PBS (100 \xl) three times. The "filled 

20 in" DNA fragments were then released from the Streptawells by digestion with the restriction 
enzyme EcoRI, according to the manufacturer's instructions that were supplied with the enzyme. 
Digestion was performed for 1 hour at 37 °C with shaking at 120 rpm. 
Detection of the Locus of Interest 

After release from the streptavidin matrix, the sample was loaded into a lane of a 36 cm 5% 

25 acrylamide (urea) gel (BioWhittaker Molecular Applications, Long Ranger Run Gel Packs, catalog 
number 50691). The sample was electrophoresed into the gel at 3000 volts for 3 min. The gel was 
run for 3 hours on a sequencing apparatus (Hoefer SQ3 Sequencer). The gel was removed from the 
apparatus and scanned on the Typhoon 9400 Variable Mode Imager. The incorporated labeled 
nucleotide was detected by fluorescence. 

30 As shown in FIG. 1 1, the template DNA in lanes 1 and 2 for SNP TSC0837969 is 

homozygous for adenine. The following fill-in reaction was expected to occur if the individual was 
homozygous for adenine: 

Homozygous for adenine at TSC 0837969: 

35 
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5' TTAA A T G* 
3'AATT T A C A 
Overhang position 12 3 4 



5 Unlabeled dATP was incorporated in the first position complementary to the overhang. 

Unlabeled dTTP was incorporated in the second position complementary to the overhang. Labeled 
ddGTP was incorporated in the third position complementary to the overhang. Only one band was 
seen, which migrated at about position 46 of the acrylamide gel. This indicated that adenine was 
the nucleotide filled in at position one. If the nucleotide guanine had been fijled in, a band would 
10 be expected at position 44. 

However, the template DNA in lanes 3 and 4 for SNP TSC0837969 was heterozygous. The 
following fill-in reactions were expected if the individual was heterozygous: 



15 



Heterozygous at TSC0837969: 

Allele 15' TTAA G* 

3'AATT C A C A 
Overhangposition 12 3 4 



20 Allele 25' TTAA A T G* 

3'AATT T A C A 
Overhangposition 12 3 4 



Two distinct bands were seen; one band corresponds to the molecules filled in with ddGTP 
25 at position 1 complementary to the overhang (the G allele), and the second bhnd corresponds to 
molecules filled in with ddGTP at position 3 complementary to the overhang (the A allele). The 
two bands were separated based on the differences in molecular weight using gel electrophoresis. 
One fluorescently labeled nucleotide ddGTP was used to determine that an individual was 
heterozygous at a SNP site. This is the first use of a single nucleotide to effectively detect the 
30 presence of two different alleles. 

For SNP TSC0034767, the template DNA in lanes 1 and 3 is heterozygous for cytosine and 
guanine, as evidenced by the two distinct bands. The lower band corresponded to ddGTP filled in 
at position 1 complementary to the overhang. The second band of slightly higher molecular weight 
corresponded to ddGTP filled in at position 3, indicating that the first position in the overhang was 
35 filled in with unlabeled dCTP, which allowed the polymerase to continue to incorporate nucleotides 
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until it incorporated ddGTP at position 3 complementary to the overhang. The template DNA in 
lanes 2 and 4 was homozygous for guanine, as evidenced by a single band of higher molecular 
weight than if ddGTP had been filled in at the first position complementary to the overhang. 

For SNP TSC1 130902, the template DNA in lanes 1, 2, and 4 is homozygous for adenine at 

5 the variable site, as evidenced by a single higher molecular weight band migrating at about position 
62 on the gel. The template DNA in lane 3 is heterozygous at the variable site, as indicated by the 
presence of two distinct bands. The lower band corresponds to molecules filled in with ddGTP at 
position 1 complementary to the overhang (the guanine allele). The higher molecular weight band 
corresponds to molecules filled in with ddGTP at position 3 complementary to the overhang (the 

10 adenine allele). 

For SNP TSC0597888, the template DNA in lanes 1 and 4 was homozygous for cytosine at 
the variable site; the template DNA in lane 2 was heterozygous at the variable site, and the template 
DNA in lane 3 was homozygous for guanine. The expected fill-in reactions are diagrammed below: 



1 5 Homozygous for cytosine: 

Allele 1 T C T G ATTC3' 

G* A C TAAG5' 

4 3 2 1 Overhang position 



20 Allele 2 T C T G ATTC3' , 

G* A C TAAG5' 
4 3 2 1 Overhang position 



Homozygous for guanine: 

25 Allele 1 T C T C ATTC3' 

G* TAAG5' 

4 3 2 1 Overhang position 



Allele 2 T C T C ATTC3' 

30 G* TAAG5' 

4 3 2 1 Overhang position 



35 



Heterozygous for guanine/cytosine: 

Allele 1 T C T G ATTC3' 

G* A C TAAG5' 
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4 



3 



2 



1 



Overhang position 



Allele 2 



T 



C 



T 



C 



ATTC3' 



G* 



TAAG5' 



4 



3 



2 



1 



Overhang position 



Template DNA homozygous for guanine at the variable site displayed a single band, which 
corresponded to the DNA molecules filled in with ddGTP at position 1 complementary to the 
overhang. These DNA molecules were of lower molecular weight compared to the DNA molecules 
filled in with ddGTP at position 3 of the overhang (see lane 3 for SNP TSC0597888). The DNA 
molecules differed by two bases in molecular weight. 

Template DNA homozygous for cytosine at the variable site displayed a single band, which 
corresponds to the DNA molecules filled in with ddGTP at position 3 complementary to the 
overhang. These DNA molecules migrated at a higher molecular weight than DNA molecules filled 
in with ddGTP at position 1 (see lanes 1 and 4 for SNP TSC0597888). 

Template DNA heterozygous at the variable site displayed two bands; one band 
corresponded to the DNA molecules filled in with ddGTP at position 1 complementary to the 
overhang and was of lower molecular weight, and die second band corresponded to DNA molecules 
filled in with ddGTP at position 3 complementary to die overhang, and was of higher molecular 
weight (see lane 3 for SNP TSC0597888). 

For SNP TSC0195492, the template DNA in lanes 1 and 3 was heterozygous at the variable 
site, which was demonstrated by the presence of two distinct bands. The template DNA in lane 2 
was homozygous for guanine at the variable site. The template DNA in lane 4 was homozygous for 
cytosine. Only one band was seen in lane 4 for this SNP, and it had a higher molecular weight than 
the DNA molecules filled in with ddGTP at position 1 complementary to the overhang (compare 
lanes 2, 3 and 4). 

The observed alleles for SNP TSC0607185 are reported as cytosine or thymidine. For 
consistency, the SNP consortium denotes the observed alleles as they appear in the sense strand 
www.snp.cshl.org/shpsearch.shtml; website active as of February 1 1, 2003). For this SNP, the 
second primer annealed upstream of the locus of interest, which allowed the fill-in reaction to occur 
on the antisense strand after digestion with BsmF I. 

The template DNA in lanes 1 and 3 was heterozygous; the template DNA in lane 2 was 

t 

homozygous for thymidine, and the template DNA in lane 4 was homozygous for cytosine. The 
antisense strand was filled in with ddGTP, so the nucleotide on the sense strand corresponded to 
cytosine. 
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Molecular weight markers can be used to identify the positions of the expected bands. 
Alternatively, for each SNP analyzed, a known heterozygous sample can be used, which will 
identify precisely the position of the two expected bands. 

As demonstrated in FIG. 1 1, one nucleotide labeled with one fluorescent dye can be used to 
determine the identity of a variable site including but not limited to SNPs and single nucleotide 
mutations. Typically, to determine if an individual is homozygous or heterozygous at a SNP site, 
multiple reactions are performed using one nucleotide labeled with one dye and a second nucleotide 
labeled with a second dye. However, this introduces problems in comparing results because the 
two dyes have different quantum coefficients. Even if different nucleotides are labeled with the 
same dye, the quantum coefficients are different. The use of a single nucleotide labeled with one 
dye eliminates any errors from the quantum coefficients of different dyes. 

In this example, fluorescently labeled ddGTP was used. However, the method is applicable 
for a nucleotide tagged with any signal generating moiety including but not limited to radioactive 
molecule, fluorescent molecule, antibody, antibody fragment, hapten, carbohydrate, biotin, 
derivative of biotin, phosphorescent moiety, luminescent moiety, electrochemiluminescent moiety, 
chromatic moiety, and moiety having a detectable electron spin resonance, electrical capacitance, 
dielectric constant or electrical conductivity. In addition, labeled ddATP, ddTTP, or ddCTP can be 
used. 

The above example used the third position complementary to the overhang as an indicator 
of the second allele. However, the second or fourth position of the overhang can be used as well 
(see Section on Incorporation of Nucleotides). Furthermore, the overhang was generated with the 
type IIS enzyme BsmF I; however any enzymes that cuts DNA at a distance from its binding site 
can be used including but not limited to the enzymes listed in Table I. 

Also, in the above example, the nucleotide immediately preceding the SNP site was not a 
guanine on the strand that was filled in. This eliminated any effects of the alternative cutting 
properties of the type IIS restriction enzyme to be removed. For example, at SNP TSC0837969, the 
nucleotide from the SNP site on the sense strand was an adenine. If BsmF I displayed alternate 
cutting properties, the following overhangs would be generated for the adenine allele and the 
guanine allele: 



G allele- 11/15 Cut 



5'TTA 



3* AAT 



T 



C 



A 



C 



Overhang position 



0 



1 



2 



3 
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G allele after fill-in 5'TTA A G* 

3'AAT T C A C 

Overhang position 0 12 

A allele 11/15 Cut 5'TTA 

3' AAT T T A C 

Overhang position 0 12 3 



10 A allele after fill-in 5'TTA A A T G* 

3'AAT T T A C 

Overhang position 0 12 3 

For the guanine allele, the first position in the overhang would be filled in with dATP, 
1 5 which would allow the polymerase to incorporate ddGTP at position 2 complementary to the 

overhang. There would be no detectable difference between molecules cut at the 10/14 position or 

molecules cut at the 1 1/15 position. 

For the adenine allele, the first position complementary to the overhang would be filled in 

with dATP, the second position would be filled in with dATP, the third position would be filled in 
20 with dTTP, and the fourth position would be filled in with ddGTP. There would be no difference in 

the molecular weights between molecules cut at 10/14 or molecules cut at 11/15. The only 

differences would correspond to whether the DNA molecules contained an adenine at the variable 

site or a guanine at the variable site. 

As seen in FIG. 1 1, positioning the annealing region of the first primer allows multiple 
25 SNPs to be analyzed in a single lane of a gel. Also, when using the same nucleotide with the same 

dye, a single fill-in reaction can be performed. In this example, 6 SNPs were analyzed in one lane. 

However, any number of SNPs including but not limited to 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 

14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 30-40, 40-50, 51-60, 61-70, 71-80, 

81-100, 101-120, 121-140, 141-160, 161-180, 181-200, and greater than 200 can be analyzed in a 
30 single reaction. 

Furthermore, one labeled nucleotide used to detect both alleles can be mixed with a second 
labeled nucleotide used to detect a different set of SNPs provided that neither of the nucleotides that 
are labeled occur immediately before the variable site (complementary to nucleotide at position 0 of 
the 1 1/15 cut). For example, suppose SNP X can be guanine or thymidine at the variable site and 
35 has the following 5 ' overhang generated after digestion with BsmF I: 
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SNP X 10/14 5' TTGAC 

GaUele 3'AACTG C A C T 

Overhang position 12 3 4 

SNP X 11/15 5'TTGA 

GaUele 3'AACT G . C A C 

t 

Overhang position 0 12 3 



10 



SNP X 10/14 5' TTGAC 

T allele 3'AACTG A A C T 

Overhang position 12 3 4 



15 



SNP X 11/15 5'TTGA 

T allele 3'AACT G A A C 

Overhang position 0 12 3 



20 



After the fill-in reaction with labeled ddGTP, unlabeled dATP, dCTP, and dTTP, the 
following molecules would be generated: 

SNP X 10/14 5' TTGAC G* 

GaUele 3'AACTG C A C T 

Overhang position 1 2 3 4, 



25 



SNP X 11/15 5'TTGA C G* 

GaUele 3'AACT G C A C 

Overhang position 0 12 3 



30 



SNP X 10/14 
T allele 

Overhang position 



5' TTGAC 
3'AACTG 



T 

A 
1 



T 
A 
2 



G* 

C 

3 



T 
4 



35 



SNP X 11/15 
T allele 

Overhang position 



5'TTGA 
3'AACT 



C 
G 
0 



T 
A 
1 



T 
A 
2 



G* 

C 
3 
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Now suppose SNP Y can be adenine or thymidine at the variable she, and has the following 
5' overhangs generated after digestion with BsmF L 



SNP Y 10/14 
A allele 

Overhang position 



5* GTTT 
3'CAAA 



T 
1 



G 
2 



T 
3 



A 
4 



10 



SNP Y 11/15 5'GTT 

A allele 3' CAA A T G T 

Overhang position 0 12 3 



15 



SNP Y 10/14 
T allele 

Overhang position 



5'GTTT 
3'CAAA 



A 
1 



G 
2 



T 
3 



A 

4 ' 



20 



SNP Y 11/15 5'GTT 

Tallele 3' CAA A A G T 

Overhang position 0 12 3 



After fill-in with labeled ddATP and unlabeled dCTP, dGTP, and dTTP, the following 
molecules would be generated: 



25 



SNP Y 10/14 
A allele 

Overhang position 



5' GTTT 
3'CAAA 



A* 

T 

1 



G 

2. 



T 

3 



A 
4 



30 



SNP Y 11/15 5'GTT T A* 

Aallele 3' CAA A T G T 

Overhang position 0 12 3 



35 



SNP Y 10/14 
Tallele 

Overhang position 



5' GTTT 
3'CAAA 



T 
A 
1 



C 
G 
2 



A* 

T 

3 



A 
4 
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SNPY 11/15 5'GTT T T C A* 

T allele 3'CAA A A G T 

Overhang position 0 12 3 

5 

In this example, labeled ddGTP and labeled ddATP are used to determine the identity of 
both alleles of SNP X and SNP Y respectively. The nucleotide immediately preceding (the 
complementary nucleotide to position 0 of the overhang from the 1 1/15 cut SNP X is not guanine 
or adenine on the strand that is filled-in. Likewise, the nucleotide immediately preceding SNPY is 

10 not guanine or adenine on the strand that is filled-in. This allows the fill-in reaction for both SNPs 
to occur in a single reaction with labeled ddGTP, labeled ddATP, and unlabeled dCTP and dTTP. 
This reduces the number of reactions that need to be performed and increases the number of SNPs 
that can be analyzed in one reaction. 

The first primers for each SNP can be designed to anneal at different distances from the 

1 5 locus of interest, which allows the SNPs to migrate at different positions on the gel. For example, 
the first primer used to amplify SNP X can anneal at 30 bases from the locus of interest, and the 
first primer used to amplify SNP Y can anneal at 35 bases from the locus of interest. Also, the 
nucleotides can be labeled with fluorescent dyes that emit at spectrums that do not overlap. After 
running the gel, the gel can be scanned at one wavelength specific for one dye. Only those 

20 molecules labeled with that dye will emit a signal. The gel then can be scanned at the wavelength 
for the second dye. Only those molecules labeled with that dye will emit a signal. This method 
allows maximum compression for the number of SNPs that can be analyzed in a single reaction. 

In this example, the nucleotide preceding the variable site on the strand that was filled-in 
was not adenine or guanine, and the nucleotide following the variable site can not be adenine or 

25 guanine on the sense strand. This method can work with any combination of labeled nucleotides, 
and the skilled artisan would understand which labeling reactions can be mixed and those that can 
not. For instance, if one SNP is labeled with thymidine and a second SNP is labeled with cytosine, 
the SNPs can be labeled in a single reaction if the nucleotide immediately preceding each variable 
site is not thymidine or cytosine on the sense strand and the nucleotide immediately after the 

30 variable site is not thymidine or cytosine on the sense strand. 

This method allows the signals from one allele to be compared to the signal from a second 
allele without the added complexity of determining the degree of alternate cutting, or having to 
correct for the quantum coefficients of the dyes. This method is especially useful when trying to 
quantitate a ratio for one allele to another. For example, this method is useful for detecting 

35 chromosomal abnormalities. The ratio of alleles at a hetero2ygous site is expected to be about 1 : 1 
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(one A allele and one G allele). However, if an extra chromosome is present the ratio is expected to 
be about 1 :2 (one A allele and 2 G alleles or 2 A alleles and 1 G allele). This method is especially 
useful when trying to detect fetal DNA in the presence of maternal DNA. 

In addition, this method is useful for detecting two genetic signals in one sample. For 
5 example, this method can detect mutant cells in the presence of wild type cells (see Example 5). If 
a mutant cell contains a mutation in the DNA sequence of a particular gene, this method can be 
used to detect both the mutant signal and the wild type signal. This method can be used to detect ' 
the mutant DNA sequence in the presence of the wild type DNA sequence. The ratio of mutant 
DNA to wild type DNA can be quantitated because a single nucleotide labeled with one signal 
1 0 generating moiety is used. 

EXAMPLE 7 

Non-invasive methods for the detection of various types of cancer have the potential to 
15 reduce morbidity and mortality from the disease. Several techniques for the early detection of 
colorectal tumors have been developed including colonoscopy, barium enemas, and sigmoidoscopy; 
however the techniques are limited in use because they are invasive, which causes a low rate of 
patient compliance. Non-invasive genetic tests may be useful in identifying early stage colorectal 
tumors. 

20 In 1991, researchers identified the Adenomatous Polyposis Coli gene (APC), which plays a 

critical role in the formation of colorectal tumors (Kinzler et aL 9 Science 253:661-665, 1991). The 
APC gene resides on chromosome 5q21-22 and a total of 15 exons code for an RNA molecule of 
8529 nucleotides, which produces a 300 Kd APC protein. The protein is expressed in numerous 
cell types and is essential for cell adhesion. 

25 Mutations in the APC gene generally initiate colorectal neoplasia (Tsao, J. et al 9 Am, J. 

Pathol. 145:531-534, 1994). Approximately 95% of the mutations in the APC gene result in 
nonsense/frameshift mutations. The most common mutations occur at codons 1061 and 1309; 
mutations at these codons account for 1/3 of all germline mutations. With regard to somatic 
mutations, 60%, occur within codons 1286-1513, which is about 10% of the coding sequence. This 

30 region is termed the mutation Cluster Region (MCR). Numerous types of mutations have been 
identified in the APC gene including nucleotide substitutions (see Table VII ), splicing errors (see 
Table VIII), small deletions (see Table DC), small insertions (see Table X), small 
insertions/deletions (see Table XI), gross deletions (see Table XII), gross insertions (see Table 
XIII), and complex rearrangements (see Table XIV). 
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Researchers have attempted to identify cells harboring mutations in the APC gene in stool 
samples (Traverso, G. et al, New England Journal of Medicine, Vol 346:311-320, 2002), While 
APC mutations are found in nearly ail tumors, about 1 in 250 cells in the stool sample has a 
mutation in the APC gene; most of the cells are normal cells that have been shed into the feces. 
5 Furthermore, human DNA represents about one-billionth of the total DNA found in stool samples; 
the majority of DNA is bacterial. The technique employed by Traverso et al. only detects 
mutations that result in a truncated protein. 

As discussed above, numerous mutations in the APC gene have been implicated in the 
formation of colorectal tumors. Thus, a need still exists for a highly sensitive, non-invasive 
10 technique for the detection of colorectal tumors. Below, methods are described for detection of two 
mutations in the APC gene. However, any number of mutations can be analyzed using the methods 
described herein. t 

Preparation of Template DNA 

1 5 The template DNA is purified from a sample containing colon cells including but not 

limited to a stool sample. The template DNA is purified using the procedures described by 
Ahlquist et al. (Gastroenterology, 1 19:1219-1227, 2000). If stool samples are frozen, the samples 
are thawed at room temperature, and homogenized with an Exactor stool shaker (Exact 
Laboratories, Maynard, Mass.) Following homogenization, a 4 gram stool equivalent of each 

20 sample is centrifuged at 2536 x g for 5 minutes. The samples are centrifuged a second time at 16, 
500 x g for 10 minutes. Superaatants are incubated with 20 \il of RNase (0.5 mg per milliliter) for 1 
hour at 37°C. DNA is precipitated with 1/10 volume of 3 mol of sodium acetate per liter and an 
equal volume of isopropanol. The DNA is dissolved in 5 ml of TRIS-EDTA (0.01 mol of Tris per 
liter (pH 7.4) and 0.001 mole of EDTA per liter. 

25 

Design of Primers 

To determine if a mutation resides at codon 1370, the following primers are used: 
First primer: 

5' GTGCAAAGGCCTGAATTCCCAGGCACAAAGCTGTTGAA 3' 

30 

Second primer: 

5' TGAAGCGAACTAGGGACTCAGGTGGACTT 
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The first primer contains a biotin tag at the extreme 5' end, and the nucleotide sequence for 
the restriction enzyme EcoRI. The second primer contains the nucleotide sequence for the 
restriction enzyme BsmF I. 

■ To determine if a small deletion exists at codon 1302, the following primers are used: 
5 First primer: 

5' GATTCCGTAAACGAATTCAGTTCATTATCATCTTTGTC 3' 

Second primer: 

5' CCATTGTTAAGCGGGACTTCTGCTATTTG 3 * 

10 

The first primer has a biotin tag at the 5' end and contains a restriction enzyme recognition 
site for EcoRI. The second primer contains a restriction enzyme recognition site for BsmF I. 

PCR Reaction 

15 The loci of interest are amplified from the template genomic DNA using the polymerase 

chain reaction (PCR, U.S. Patent Nos. 4,683,195 and 4,683,202, incorporated herein by reference). 
The loci of interest are amplified in separate reaction tubes; they can also be amplified together in a 
single PCR reaction. For increased specificity, a "hot-start" PCR reaction is used, e.g. by using the 
HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). The amount of 

20 template DNA and primer per reaction are optimized for each locus of interest but in this example, 
40 ng of template human genomic DNA and 5 ^M of each primer are used. Forty cycles of PCR 
are performed. The following PCR conditions are used: 

(1) 95°C for 15 minutes and 15 seconds; 
25 (2) 37°C for 30 seconds; 

(3) 95°C for 30 seconds; 

(4) 57°Cfor30 seconds; 

(5) 95°C for 30 seconds; 

(6) 64°C for 30 seconds; 
30 (7) 95°C for 30 seconds; 

(8) Repeat steps 6 and 7 thirty nine (39) times; 

(9) 72°C for 5 minutes. 

In the first cycle of PCR, the annealing temperature is about the melting temperature of the 
35 3' annealing region of the second primers, which is 37°C. The annealing temperature in the second 
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cycle of PCR is about the melting temperature of the 3 1 region, which anneals to the template DNA, 
of the first primer, which is 57°C. The annealing temperature in the third cycle of PCR is about the 
melting temperature of the entire sequence of the second primer, which is 64 8 C. The annealing 
temperature for the remaining cycles is 64°C. Escalating the annealing temperature from TM1 to 
5 TM2 to TM3 in the first three cycles of PCR greatly improves specificity. These annealing 

temperatures are representative, and the skilled artisan understands that the annealing temperatures 
for each cycle are dependent on the specific primers used. 

The temperatures and times for denaturing, annealing, and extension, are optimized by 
trying various settings and using the parameters that yield the best results. 

10 

Purification of Fragment of Interest 

The PCR products are separated from the genomic template DNA. Each PCR product is 
divided into four separate reaction wells of a Streptawell, transparent, High-Bind plate from Roche 
Diagnostics GmbH (catalog number 1 645 692, as listed in Roche Molecular Biochemicals, 2001 

1 5 Biochemicals Catalog). The first primers contain a 5' biotin tag so the PCR products bound to the 
Streptavidin coated wells while the genomic template DNA does not The streptavidin binding 
reaction is performed using a Thermomixer (Eppendorf) at 1000 rpm for 20 min. at 37°C. Each 
well is aspirated to remove unbound material, and washed three times with IX PBS, with gentle 
mixing (Kandpal et al., Nucl. Acids Res. 18:1789-1795 (1990); Kaneoka et al., Biotechniques 

20 10:30-34 (1991); Green et al, Nucl. Acids Res. 18:6163-6164 (1990)). 

Alternatively, the PCR products are placed into a single well of a streptavidin plate to 
perform the nucleotide incorporation reaction in a single well. 

Restriction Enzyme Digestion of Isolated Fragments 
25 The purified PCR products are digested with the restriction enzyme BsmF I (New England 

Biolabs catalog number R0572S), which binds to the recognition site incorporated into the PCR 
products from the second primer. The digests are performed in the Streptawells following the 
instructions supplied with the restriction enzyme. After digestion with the appropriate restriction 
en2yme, the wells are washed three times with PBS to remove the cleaved fragments. 

30 

Incorporation of Labeled Nucleotide 

The restriction enzyme digest described above yields a DNA fragment with a 5 f overhang, 
which contains the locus of interest and a 3 1 recessed end. The 5* overhang functions as a template 
allowing incorporation of a nucleotide or nucleotides in the presence of a DNA polymerase. 
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For each locus of interest, four separate fill in reactions are performed; each of the four 
reactions contains a different fluorescently labeled ddNTP (ddATP, ddTTP, ddGTP, or ddCTP). 
The following components are added to each fill in reaction: 1 ]i\ of a fluorescently labeled ddNTP, 
0.5 \i\ of unlabeled ddNTPs ( 40 nM), which contains all nucleotides except the nucleotide that is 
5 fluorescently labeled, 2 \kl of 10X sequenase buffer, 0.25 \il of Sequenase, and water as needed for a 
20jli1 reaction. The fill are performed in reactions at 40°C for 10 min. Non-fluorescently labeled 
ddNTP are purchased from Fermentas Inc. (Hanover, MD). All other labeling reagents are obtained 
from Amersham (Thermo Sequenase Dye Terminator Cycle Sequencing Core Kit, US 79565). In 
the presence of fluorescently labeled ddNTPs, the 3' recessed end is extended by one base, which 

10 corresponds to the locus of interest. 

A mixture of labeled ddNTPs and unlabeled dNTPs also can be used for the fill-in reaction. 
The "fill in" conditions are as described above except that a mixture containing 40 jiM unlabeled 
dNTPs, 1 \il fluorescently labeled ddATP, 1 \il fluorescently labeled ddTTP, 1 jxl fluorescently 
labeled ddCTP, and 1 nl ddGTP are used. The fluorescent ddNTPs are obtained from Amersham 

15 (Thermo Sequenase Dye Terminator Cycle Sequencing Core Kit, US 79565; Amersham does not 
publish the concentrations of the fluorescent nucleotides). The locus of interest is digested with the 
restriction enzyme BsmF I, which generates a 5* overhang of four bases. If the first nucleotide 
incorporated is a labeled ddNTP, the 3' recessed end is filled in by one base, allowing detection of 
the locus of interest. However, if the first nucleotide incorporated is a dNTP, the polymerase 

20 continues to incorporate nucleotides until a ddNTP is filled in. For example, the first two 

nucleotides may be filled in with dNTPs, and the third nucleotide with a ddNTP, allowing detection 
of the third nucleotide in the overhang. Thus, the sequence of the entire 5* overhang is determined, 
which increases die information obtained from each SNP or locus of interest. This type of fill in 
reaction is especially useful when detecting the presence of insertions, deletions, insertions and 

25 deletions, rearrangements, and translocations. 

Alternatively, one nucleotide labeled with a single dye is used to determine the sequence of 
the locus of interest. See Example 6. This method eliminates any potential errors when using 
different dyes, which have different quantum coefficients. 

After labeling, each Streptawell is rinsed with IX PBS (100 three times. The "filled in" 

30 DNA fragments are released from the Streptawells by digesting with the restriction enzyme EcoRI, 
according to the manufacturer's instructions that are supplied with the enzyme. The digestion is 
performed for 1 hour at 37 °C with shaking at 120 rpm. 
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Detection of the Locus of Interest 

After release from the streptavidin matrix, the sample is loaded into a lane of a 36 cm 5% 
acrylamide (urea) gel (BioWhittaker Molecular Applications, Long Ranger Run Gel Packs, catalog 
number 50691). The sample is electrophoresed into the gel at 3000 volts for 3 min. The gel is run 
5 for 3 hours using a sequencing apparatus (Hoefer SQ3 Sequencer). The incorporated labeled 
nucleotide is detected by fluorescence. 

To determine if any cells contain mutations at codon 1370 of the APC gene when separate 
fill-in reactions are performed, the lanes of the gel that correspond to the fill-in reaction for ddATP 
and ddTTP are analyzed. If only normal cells are present, the lane corresponding to the fill in 

1 0 reaction with ddATP is a bright signal. No signal is detected for the "fill-in" reaction with ddTTP. 
However, if the patient sample contains cells with mutations at codon 1370 of the APC gene, the 
lane corresponding to the fill in reaction with ddATP is a bright signal, and a signal is detected from 
the lane corresponding to the fill in reaction with ddTTP. The intensity of the signal from the lane 
corresponding to the fill in reaction with ddTTP is indicative of the number of mutant cells in the 

15 sample. 

Alternatively, one labeled nucleotide is used to determine the sequence of the alleles at 
codon 1370 of the APC gene. At codon 1370, the normal sequence is AAA, which codes for the 
amino acid lysine. However, a nucleotide substitution has been identified at codon 1370, which is 
associated with colorectal tumors. Specifically, a change from A to T (AAA-TAA) typically is 
20 found at codon 1370, which results in a stop codon. A single fill-in reaction »is performed using 
labeled ddATP, and unlabeled dTTP, dCTP, and dGTP. A single nucleotide labeled with one 
fluorescent dye is used to determine the presence of both the normal and mutant DNA sequence that 
codes for codon 1370. The relevant DNA sequence is depicted below with the sequence 
corresponding to codon 1370 in bold: 

25 

5* CCCAAAAGTCCACCTGA 
3* GGGTTTTCAGGTGGACT 

After digest with BsmF I, the following overhang is produced: 

30 

5' CCC 

3'GGG T T T T 

Overhang position 12 3 4 
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If the patient sample has no cells harboring a mutation at codon 1370, one signal is 
seen corresponding to incorporation of labeled ddATP. 



5* CCC A* 
5 3' GGG T T T T 

Overhang position 12 3 4 

However, if the patient sample has cells with mutations at codon 1370 of the APC 
gene, one signal is seen, which corresponds to the normal sequence at codon 1370, and a second 
10 signal is seen, which corresponds to the mutant sequence at codon 1370. The signals clearly are 
identified as they differ in molecular weight. 

Overhang of normal DNA sequence: CCC 

GGG T T T T 
15 Overhang position 12 3 4 

Normal DNA sequence after fill-in: CCC A* 

GGG T T T T 



20 



Overhang position 1 2 3 4 

Overhang of mutant DNA sequence: CCC 

GGG A T T T 

Overhang position 12 3 4 



25 Mutant DNA sequence after fill-in: CCC T A* 

GGG A T T T 
Overhang position 12 3 4 



Two signals are seen when the mutant allele is present. The mutant DNA molecules are 
30 filled in one base after the wild type DNA molecules. The two signals are separated using any 
method that discriminates based on molecular weight One labeled nucleotide (ddATP) is used to 
detect the presence of both the wild type DNA sequence and the mutant DNA sequence. This 
method of labeling reduces the number of reactions that need to be performed and allows accurate 
quantitation for the number of mutant cells in the patient sample. The number of mutant cells in the 
3 5 sample is used to determine patient prognosis, the degree and the severity of the disease. This 
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method of labeling eliminates the complications associated with using different dyes, which have 
distinct quantum coefficients. This method of labeling also eliminates errors associated with 
pipetting reactions. 

To determine if any cells contain mutations at codon 1302 of the APC gene when separate 
fill-in reactions are performed, the lanes of the gel that correspond to the fill-in reaction for ddTTP 
and ddCTP are analyzed. The normal DNA sequence is depicted below with sequence coding for 
codon 1302 in bold type-face. 

Normal Sequence: 5' ACCCTGCAAATAGCAGAA 
3* TGGGACGTTTATCGTCTT 

After digest, the following 5' overhang is produced: 

5' ACCC 

3'TGGG A C G T 
Overhang position 12 3 4 

After the fill-in reaction, labeled ddTTP is incorporated. 

5' ACCC T* 

3'TGGG A C G T 
Overhang position 12 3 4 

A deletion of a single base of the APC sequence, which typically codes for codon 1302, has 
been associated with colorectal tumors. The mutant DNA sequence is depicted below with the 
relevant sequence in bold: 

Mutant Sequence: 5' ACCCGCAAATAGCAGAA 
3' TGGGCGTTTATCGTCTT 

After digest* 

5'ACC 

3'TGG G C G T 

Overhang position 12 3 4 
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After fill-in: 

5' ACC C* 

3'TGG G C G T 

Overhang position 12 3 4 

If there are no mutations in the APC gene, signal is not detected for the fill in reaction with 
ddCTP*, but a bright signal is detected for the fill-in reaction with ddTTP*. However, if there are 
cells in the patient sample that have mutations in the APC gene, signals are seen for the fill-in 
reactions with ddCTP*and ddTTP*. 

Alternatively, a single fill-in reaction is performed using a mixture containing unlabeled 
dNTPs, fluorescently labeled ddATP, fluorescently labeled ddTTP, fluorescently labeled ddCTP, 
and fluorescently labeled ddGTP. If there is no deletion, labeled ddTTP is incorporated. 

5' ACCC T* 

3'TGGG A C G T 
Overhang position 12 3 4 

However, if the T has been deleted, labeled ddCTP* is incorporated. 

5' ACCC* 

3'TGGG C G T 
Overhang position 12 3 4 

The two signals are separated by molecular weight because of the deletion of the thymidine 
nucleotide. If mutant cells are present, two signals are generated in the same lane but are separated 
by a single base pair (this principle is demonstrated in FIG 9D). The deletion causes a change in 
the molecular weight of the DNA fragments, which allows a single fill in reaction to be used to 
detect the presence of both normal and mutant cells. 

In the above example, methods for the detection of a nucleotide substitution and a small 
deletion are described. However, the methods can be used for the detection of any type of mutation 
including but not limited to nucleotide substitutions (see Table VII), splicing errors (see Table 
Vm), small deletions (see Table EX), small insertions (see Table X), small insertions/deletions (see 
Table XI), gross deletions (see Table XII), gross insertions (see Table XIII), and complex 
rearrangements (see Table XTV). 
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In addition, the above-described methods are used for the detection of any type of disease 
including but not limited to those listed in Table IV. Furthermore, any type of mutant gene is 
detected using the inventions described herein including but not limited to the genes associated with 
the diseases listed in Table IV, BRCA1, BRCA2, MSH6, MSH2, MLH1, RET, PTEN, ATM, H- 
5 RAS, p53, ELAC2, CDH1, APC, AR, PMS2, MLH3, CYP1A1, GSTP1, GSTM1, AXIN2, CYP19, 
MET, NAT1, CDKN2A, NQ01, trc8, RAD51, PMS1, TGFBR2, VHL, MC4R, POMC, NROB2, 
UCP2, PCSK1, PPARG, ADRB2, UCP3, glurl, cart, SORBS1, LEP, LEPR, SIM1, TNF, IL-6, EL- 
1, IL-2, IL-3, ILIA, TAP2, THPO, THRB, NBS1, RBM15, LIF, MPL, RUNX1, Her-2, 
glucocorticoid receptor, estrogen receptor, thyroid receptor, p21, p27, K-RAS, N-RAS, 

10 retinoblastoma protein, Wiskott-Aldrich (WAS) gene, Factor V Leiden, Factor II (prothrombin), 
methylene tetrahydrofolate reductase, cystic fibrosis, LDL receptor, HDL receptor, superoxide 
dismutase gene, SHOX gene, genes involved in nitric oxide regulation, genes involved in cell cycle 
regulation, tumor suppressor genes, oncogenes, genes associated with neurodegeneration, genes 
associated with obesity. Abbreviations correspond to the proteins as listed on the Human Gene 

15 Mutation Database, which is incorporated herein by reference www.arcbive.uwcm.ac.uk./uwcm; 
website address active as of February 12, 2003). 

The above-example demonstrates the detection of mutant cells and mutant alleles from a 
fecal sample. However, the methods described herein are used for detection of mutant cells from 
any biological sample including but not limited to blood sample, serum sample, plasma sample, 

20 urine sample, spinal fluid, lymphatic fluid, semen, vaginal secretion, ascitic fluid, saliva, mucosa 
secretion, peritoneal fluid, fecal sample, body exudates, breast fluid, lung aspirates, cells, tissues, 
individual cells or extracts of the such sources that contain the nucleic acid of the same, and 
subcellular structures such as mitochondria or chloroplasts. In addition, the methods described 
herein are used for the detection of mutant cells and mutated DNA from any number of nucleic acid 

25 containing sources including but not limited to forensic, food, archeological, agricultural or 
inorganic samples. 

The above example is directed to detection of mutations in the APC gene. However, the 
inventions described herein are used for the detection of mutations in any gene that is associated 
with or predisposes to disease (see Table XV). 
30 For example, hypermethylation of the glutathione S-transferase PI (GSTP1) promoter is the 

most common DNA alteration in prostrate cancer. The methylation state of the promoter is 
determined using sodium bisulfite and the methods described herein. 

Treatment with sodium bisulfite converts unmethylated cytosine residues into uracil, and 
leaving the methylated cytosines unchanged. Using the methods described herein, a first and 
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10 



second primer are designed to amplify the regions of the GSTP1 promoter that are often 
methylated. Below, a region of the GSTP1 promoter is shown prior to sodium bisulfite treatment: 

Before Sodium Bisulfite treatment: 
5' ACCGCTACA 
3' TGGCGATCA 

Below, a region of the GSTP1 promoter is shown after sodium bisulfite treatment, PCR 
amplification, and digestion with the type IIS restriction enzyme BsmF I: 

Unmethylated 

5' ACC 

3'TGG U G A T 

Overhang position 12 3 4 

15 Methylated 

5' ACC 

3'TGG C G A T 

Overhang position 1 2 3 4 ; 

20 Labeled ddATP, unlabeled dCTP, dGTP, and dTTP are used to fill-in the 5' overhangs. 

The following molecules are generated: 

Unmethylated 

5' ACC A* 

25 3'TGG U G A T 

Overhang position 12 3 4 

Methylated 

5' ACC G C T A* 

30 3'TGG C G A T 

Overhang position 12 3 4 

Two signals are seen; one corresponds to DNA molecules filled in with ddATP at position 
one complementary to the overhang (unmethylated), and the other corresponds to the DNA 
35 molecules filled in with ddATP at position 4 complementary to the overhang (methylated). The 
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two signals are separated based on molecular weight. Alternatively, the fill-in reactions are 
performed in separate reactions using labeled ddGTP in one reaction and labeled ddATP in another 
reaction. 

The methods described herein are used to screen for prostate cancer and also to monitor the 
5 progression and severity of the disease. The use of a single nucleotide to detect both the methylated 
and unmethylated sequences allows accurate quantitation and provides a high level of sensitivity for 
the methylated sequences, which is a useful tool for earlier detection of the disease. 

The information contained in Tables VII-XIV was obtained from the Human Gene 
Mutation Database. With the information provided herein, the skilled artisan will understand how 
10 to apply these methods for determining the sequence of the alleles for any gene. A large number of 
genes and there associated mutations can be found at the following website: 
www.archive.uwcm.ac.uk./uwcm. 



TABLE VH: NUCLEOTIDE SUBSTITUTIONS 



Codon 


Nucleotide 


Ammo aciu 


PliannfvnA t 

x nenoiype 


99 


CGG-TGG 


Are-Tro 


Adenomatous polyposis coli 


121 


AGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


157 


TGG-TAG 


Trp-Term 


Adenomatous polyposis coli 


159 


TAC-TAG 


Tyr-Term 


Adenomatous polyposis coli 


163 


CAG-TAG 


Gin-Term j 


Adenomatous polyposis coli 


168 


AGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


171 


AGT-ATT 


Ser-Ile 


Adenomatous polyposis coli 


181 


CAA-TAA 


Gin-Term 


Adenomatous polyposis coli 


190 


GAA-TAA 


Glu-Term 


Adenomatous polyposis coli 


202 


GAA-TAA 


GIu-Term 


Adenomatous polyposis coli 


208 


CAG-CGG 


Gln-Arg 


Adenomatous polyposis coli 


208 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


213 


CGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


215 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


216 


CGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


232 


CGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


233 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


247 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


267 


GGA-TGA 


Gly-Term 


Adenomatous polyposis coli 
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278 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


280 


TCA-TGA 


Ser-Term 


Adenomatous polyposis coli 


280 


TCA-TAA 


Ser-Term 


Adenomatous polyposis coli 


283 


CGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


302 


CGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


332 


CGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


358 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


405 


CGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


414 


CGC-TGC 


Arg-Cys 


Adenomatous polyposis coli 


422 


GAG-TAG 


Glu-Term 


Adenomatous polyposis coli 


423 


TGG-TAG 


Trp-Tenn 


Adenomatous polyposis coli 


424 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


433 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


443 


GAA-TAA 


Glu-Term 


Adenomatous polyposis coli 


457 


TCA-TAA 


Ser-Term 


Adenomatous polyposis coli 


473 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


486 


TAC-TAG 


Tyr-Term 


Adenomatous polyposis coli 


499 


CGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


500 


TAT-TAG 


Tyr-Term 


Adenomatous polyposis coli 


541 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


553 


TGG-TAG 


Trp-Term 


Adenomatous polyposis coli 


554 


CGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


564 


CGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


577 


TTA-TAA 


Leu-Term 


Adenomatous polyposis coli 


586 


AAA-TAA 


Lys-Term 


Adenomatous polyposis coli 


592 


TTA-TGA 


Leu-Term 


Adenomatous polyposis coli 


593 


TGG-TAG 


Trp-Term 


Adenomatous polyposis coli 


593 


TGG-TGA 


Trp-Term 


Adenomatous polyposis coli 


622 


TAC-TAA 


Tyr-Term 


Adenomatous polyposis coli 


625 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


629 


TTA-TAA 


Leu-Term 


Adenomatous polyposis coli 


650 


GAG-TAG 


Glu-Term 


Adenomatous polyposis coli 


684 


TTG-TAG 


Leu-Tenn 


Adenomatous polyposis coli 


685 


TGG-TGA 


Trp-Term 


Adenomatous polyposis coli 
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695 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


699 


TGG-TGA 


Tip-Terra 


Adenomatous polyposis coli 


699 


TGG-TAG 


Trp-Term 


Adenomatous polyposis coli 


713 


TCA-TGA 


Ser-Term 


Adenomatous polyposis coli 


722 


AGT-GGT 


Ser-Gly 


Adenomatous polyposis coli 


747 


TCA-TGA 


Ser-Term 


Adenomatous polyposis coli 


764 


TTA-TAA 


Leu-Term 


Adenomatous polyposis coli . 


784 


TCT-ACT 


Ser-Thr 


Adenomatous polyposis coli 


805 


CGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


811 


TCA-TGA 


Ser-Term 


Adenomatous polyposis coli 


848 


AAA-TAA 


Lys-Term 


Adenomatous polyposis coli 


876 


CGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


879 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


893 


GAA-TAA 


Glu-Term 


Adenomatous polyposis coli 


932 


TCA-TAA 


Ser-Term 


Adenomatous polyposis coli • 


932 


TCA-TGA 


Ser-Term 


Adenomatous polyposis coli 


935 


TAC-TAG 


Tyr-Term 


Adenomatous polyposis coli 


935 


TAC-TAA 


Tyr-Term 


Adenomatous polyposis coli 


995 


TGC-TGA 


Cys-Term 


Adenomatous polyposis coli 


997 


TAT-TAG 


Tyr-Term 


Adenomatous polyposis coli 


999 


CAA-TAA 


Gin-Term 


Adenomatous polyposis coli 


1000 


TAC-TAA 


Tyr-Term 


Adenomatous polyposis coli 


1020 


GAA-TAA 


Glu-Term 


Adenomatous polyposis coli 


1032 


TCA-TAA 


Ser-Term 


Adenomatous polyposis coli 


1041 


CAA-TAA 


Gin-Term 


Adenomatous polyposis coli 


1044 


TCA-TAA 


Ser-Term 


Adenomatous polyposis coli 


1045 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


1049 


TGG-TGA 


Tip-Term 


Adenomatous polyposis coli 


1067 


CAA-TAA 


Gin-Term 


Adenomatous polyposis coli 


1071 


CAA-TAA 


Gin-Term 


Adenomatous polyposis coli 


1075 


TAT-TAA 


Tyr-Term 


Adenomatous polyposis coli 


1075 


TAT-TAG 


Tyr-Term 


Adenomatous polyposis coli 


1102 


TAC-TAG 


Tyr-Term 


Adenomatous polyposis coli 


1110 


TCA-TGA 


Ser-Term 


Adenomatous polyposis coli 
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1114 


CGA-TGA , 


Arg-Term 


Adenomatous polyposis coli 


1123 


CAA-TAA 


Gin-Term 


Adenomatous polyposis coli 


1135 


TAT-TAG 


' jr-Term 


Adenomatous polyposis coli 


1152 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


1155 


GAA-TAA 


Glu-Term 


Adenomatous polyposis coli 


1168 


GAA-TAA 


Glu-Term 


Adenomatous polyposis coli 


1175 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


1176 


CCT-CTT 


Pro-Leu 


Adenomatous polyposis coli 


1184 


GCC-CCC 


Ala-Pro 


Adenomatous polyposis coli 


1193 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


1194 


TCA-TGA 


Ser-Term 


Adenomatous polyposis coli , 


1198 


TCA-TGA 


Ser-Term 


Adenomatous polyposis coli 


1201 


TCA-TGA 


Ser-Term 


Adenomatous polyposis coli 


1228 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


1230 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


1244 


CAA-TAA 


Gln-Tenn 


Adenomatous polyposis coli 


1249 


TGC-TGA 


Cys-Term 


Adenomatous polyposis coli 


1256 


CAA-TAA 


Gin-Term 


Adenomatous polyposis coli 


1262 


TAT-TAA 


Tyr-Term 


Adenomatous polyposis coli 


1270 


TGT-TGA 


Cys-Term 


Adenomatous polyposis coli 


1276 


TCA-TGA 


Ser-Term 


Adenomatous polyposis coli 


1278 


TCA-TAA 


Ser-Term 


Adenomatous polyposis coli 


1286 


GAA-TAA 


Glu-Term 


Adenomatous polyposis coli 


1289 


TGT-TGA 


Cys-Term 


Adenomatous polyposis coli 


1294 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


1307 


ATA-AAA 


Ile-Lys 


Colorectal cancer, predisposition to, association 


1309 


GAA-TAA 


Glu-Term 


Adenomatous polyposis coli 


1317 


GAA-CAA 


Glu-Gln 


Colorectal cancer, predisposition to | 


1328 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


1338 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


1342 


TTA-TAA 


Leu-Term 


Adenomatous polyposis coli 


1342 


TTA-TGA 


Leu-Term 


Adenomatous polyposis coli 


1348 


AGG-TGG 


Arg-Trp 


Adenomatous polyposis coli 


1357 


GGA-TGA 


Gly-Term 


Adenomatous polyposis coli 
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1367 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


1370 


AAA-TAA 


Lys-Term 


Adenomatous polyposis coli 


1392 


TCA-TAA 


Ser-Term 


Adenomatous polyposis coli 


1392 


TCA-TGA 


Ser-Term 


Adenomatous polyposis coli 


1397 


GAG-TAG 


Glu-Term 


Adenomatous polyposis coli 


1449 


AAG-TAG 


Lys-Term 


Adenomatous polyposis coli 


1450 


CGA-TGA 


Arg-Term 


Adenomatous polyposis coli 


1451 


GAA-TAA 


Glu-Term 


Adenomatous polyposis coli 


1503 


TCA-TAA 


Ser-Term 


Adenomatous polyposis coli 


1517 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


1529 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


1539 


TCA-TAA 


Ser-Term 


Adenomatous polyposis coli 


1541 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


1564 


TTA-TAA 


Leu-Term 


Adenomatous polyposis coli 


1567 


TCA-TGA 


Ser-Term 


Adenomatous polyposis coli 


1640 


CGG-TGG 


Arg-Trp 


Adenomatous polyposis coli 


1693 


GAA-TAA 


Glu-Term 


Adenomatous polyposis coli 


1822 


GAC-GTC 


Asp-Val 


Adenomatous polyposis coli, association with ? 


2038 


CTG-GTG 


Leu-vai 


ivaenomaioua poiypuaia uui-i 


2040 


CAG-TAG 


Gin-Term 


Adenomatous polyposis coli 


2566 


AGA-AAA 


Arg-Lys 


Adenomatous polyposis coli 


2621 


TCT-TGT 


Ser-Cys 


Adenomatous polyposis coli 


2839 


CTT-TTT 


Leu-Phe 


Adenomatous polyposis coli 



TABLE VIH: NUCLEOTIDE SUBSTITUTIONS 



Donor/ 
Acceptor 


Relative 
location 


Substitution 


Phenotype 


ds 


-1 


G-C 


Adenomatous polyposis coli 


as 


-1 


G-A 


Adenomatous polyposis coli 


as 


-1 


G-C 


Adenomatous polyposis coli 


ds 


+2 


T-A 


Adenomatous polyposis coli 


as 


-1 


G-C 


Adenomatous polyposis coli 


as 


-1 


G-T 


Adenomatous polyposis coli 


as 


-1 


G-A 


Adenomatous polyposis coli 
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as 


-2 


A-C 


Adenomatous polyposis coli 


as 


-5 


A-G 


Adenomatous polyposis coli 


ds 


+3 


A-C 


Adenomatous polyposis coli 


as 


-1 


G-A 


Adenomatous polyposis coli 


ds 


+1 


G-A 


Adenomatous polyposis coli 


as 


-1 


G-T 


Adenomatous polyposis coli 


ds 


+1 


G-A 


Adenomatous polyposis coli 


as 


-1 


G-A 


Adenomatous polyposis coli 


ds 


+1 


G-A 


Adenomatous polyposis coli 


ds 


+3 


A-G 


Adenomatous polyposis coli 


ds 


+5 


G-T 


Adenomatous polyposis coli 


as 


-1 


G-A 


Adenomatous polyposis coli 


as 


-6 


A-G 


Adenomatous polyposis coli 


as 


-5 


A-G 


Adenomatous polyposis coli 


as 


-2 


A-G 


Adenomatous polyposis coli 


ds 


+2 


T-C 


Adenomatous polyposis coli 


as 


-2 


A-G 


Adenomatous polyposis coli 


ds 


+1 


G-A 


Adenomatous polyposis coli 


ds 


+1 


G-T 


Adenomatous polyposis coli 


ds 


+2 


T-G 


Adenomatous polyposis coli 



TABLE DC: APC SMALL DELETIONS 

Bold letters indicate the codon. Undercase letters represent the deletion. Where deletions extend 
beyond the coding region, other positional information is provided. For example, the abbreviation 
5' UTR represents 5* untranslated region, and the abbreviation E6I6 denotes exon 6/intron 6 
boundary. 



Location/ 
codon 


Deletion 


Phenotype 


77 


TTAgataGCAGTAATTT 


Adenomatous 
polyposis coli 


97 


GGAAGccgggaagGATCTGTATC 


Adenomatous 
polyposis coli 
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138 


GAGAaAGAGAGJE3B_GTAA 


Adenomatous 
polyposis coli 


139 


AAAGAgag_E3I3J3taacttttct 


Thyroid cancer 


139 


AAAGagagJE3 B J3T AACTTTTC 


Adenomatous 
polyposis coli 


142 


TTTTAAAAAAaAAAAATAG I3E4 GTCA 


Adenomatous 
polyposis coli 


144 


AAAATAG_BE4_GTCatTGCTTCTTGC 


Adenomatous 
polyposis coli 


149 


GACAaaGAAGAAAAGG 


Adenomatous 
polyposis coli 


149 


GACAAaeaaGAAAAGGAAA 


Adenomatous 
polyposis coli 


155 


AGGAA A AAAGActffetATTACGCTCA 


Adenomatous 
polyposis coli 


169 


AAAAGA A ATAGateffTCTTCCTTTA 


Adenomatous 
polyposis coli 


172 


AGATAGT A CTTcCTTTAACTGA 


Adenomatous 
polyposis coli 


179 


TCCTTacaaACAGATATGA 


Adenomatous 
polyposis coli 


135 


1 

ACCaGAAGGCAATT 


Adenomatous 
polyposis coli 


196 


ATCAGagTTGCGATGGA 


Adenomatous 
polyposis coli 


213 


CGAGCaCAG_E5I5_GTAAGTT 


Adenomatous 
polyposis coli 


298 


CACtcTGCACCTCGA 


Adenomatous 
polyposis coli 


329 


GATaTGTCGCGAAC 


Adenomatous 
polyposis coli 


365 


AAAGActCTGTATTGTT 


Adenomatous 
polyposis coli 


397 


GACaaGAGAGGCAGG 


Adenomatous 
polyposis coli 
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427 


CATGAacCAGGCATGGA 


Adenomatous 
polyposis coli 


428 


GAACCaGGCATGGACC 


Adenomatous 
polyposis coli 


436 


AATCCaa E9I9 ffTATGTTCTCT 


Adenomatous 
polyposis coli 


440 


nCTCCtOTTO A AP ATP 


Adenomatous 
polyposis coli 


455 
*tj j 


AAA PTtTP ATTTG ATG 

AAAv 111 *~>f\ All VXtt. X VJ 


Adenomatous 
polyposis coli 


455 


A A APtttcaTTTG ATG A A G 


Adenomatous 
polyposis coli 


472 


PTAeAGGPPATTGP 

i 


Adenomatous 
polyposis coli 


47? 


TAAATTAG T10F11 GGaG A PT A P A GGP 


Adenomatous ~ 
polyposis coli 


478 


TT ATtGP A A GTGG AP 


Adenomatous 
polyposis coli 


486 


TACGffGCTTACTAAT 


Adenomatous 
polyposis coli 


494 


A GT A Tf A P A PT A A G A P 


Adenomatous 
polyposis coli 


495 


ATT APacT A AG APG ATA 

X\. X X nvdv A AnUAv VXrV A AV 


Adenomatous 
polyposis coli 


497 


CTAaGAPGATATGP 


Adenomatous 
polyposis coli 


520 


TGCTCtaTGAAAGGCTG 


Adenomatous 
polyposis coli 


526 


ATGA Gaecactt&teGCCCA ACTA A 


Adenomatous 
polyposis coli 


539 


GACTTaCAGCAG_E12I12_GTAC 


Adenomatous 
polyposis coli 


560 


AAAAAgaCGTTGCGAGA 


Adenomatous 
polyposis coli 


566 


GTTGgaagtGTGAAAGCAT 


Adenomatous 
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polyposis coli 


570 


AAAGCaTTGATGGAAT 

****** V V-*fc* A A V^XA A w Vp>i X4 *. -i> 


Adenomatous 
polyposis coli 


577 


TTAGaaetTAAAAAG E13I13 GTA 


Adenomatous 
polyposis coli 


584 


ACCCTcAAAAGCGTAT 


Adenomatous 
polyposis coli 


591 


GCCTtATGGAATTTG 


Adenomatous 
polyposis coli 


608 


GCTeTAGATGGTGC 


Adenomatous 
polyposis coli 


617 


(ZT^&QcactrttacttoccGGACiCC A G AC* 

vj jl m. ggv/M^tvtiaoiuzv^vj vj*rvvj v»*v»»xvvjjrvv_/ 


Adenomatous 
polyposis coli 


620 


CTTACttacCGGAGCCAGA 


Adenomatous 
polyposis coli 


621 


ACTTaCCGG AGCC AG 


Adenomatous 
polyposis coli 


624 


AGCcaGACAAACAfT 


Adenomatous 
polyposis coli 


624 


AGffaonc A A AP APTTTA 


Adenomatous 
polyposis coli 


626 


ACAaacaCTTTAGCCAT 

rA w**Hiu/ttv i a xiiVJwni 


Adenomatous 
polyposis coli 


629 


TTAGCcATTATTGAAA 

M. A 1*V WJ * A A il A A ^JJJkU *JT *. 


Adenomatous 
polyposis coli 


635 


GGAGeTGGGATATTA 

\jvp*»wg* *i* 


Adenomatous 
polyposis coli 


638 


ATATtACGGAATGTG 


Adenomatous 
polyposis coli 


639 


TTACGgAATGTGTCCA 


Adenomatous 
polyposis coli 


657 


AGAgaGAACAACTGT 


Adenomatous 
polyposis coli 


659 


TATTTCAGJ14E15_GCaaatcctaagagagAACA^ 


Adenomatous 
polyposis coli 
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66(1 


A APTfftPT A P A A A PTT 


Adenomatous 
polyposis coli 


665 


TTAttA C A A PA PTTA 


Adenomatous 
polyposis coli 


uuo 


r ACtt A AAA TPTP AT 


Adenomatous 
polyposis coli 


A7^ 
Of J 


A m>t<r«»n« ata otP A f»T A A TOP A 


Adenomatous 
polyposis coli 


76R 


rArTTflTrAnA a aptt 


Adenomatous 
polyposis coli 


7£Q 


TTATpAHA A AfTTTT 


Adenomatous 
polyposis coli 


77H 


1 tAU/\a/\L 1111 uALA 


Adenomatous 
polyposis coli 


/oU 


A\y 1 t^C W\AVjrVJL,/\ 1 1 


Adenomatous 
polyposis coli 


709 


A AHPn A A/TTPTPTAT 


Adenomatous 
polyposis coli 


79? 


A A CZC A a shCtTCTCT A TftCr 


Adenomatous 
polyposis coli 


7QT 


in A A A crTPTPT ATHPrT 


Adenomatous 
polyposis coli 


7QR 


^ATTntn'riTrrnAPA 

vx/VA 1 alKJ 1 111 1 VJ/\w\ 


Adenomatous 
polyposis coli 


Rft9 


fJAP APrftatf»o»<\5vtfiATfrATA ATA 


Adenomatous 
polyposis coli 


RfK 

OUJ 


r^APatPrATfiATA ATA 


Adenomatous 
polyposis coli 


Oil 


TP A GflftflpTTTT A A T A PT 


Adenomatous 
polyposis coli 


825 


TATtTGAATACTAC 


Adenomatous 
polyposis coli 


827 


AATAcTACAGTGTTA 


Adenomatous 
polyposis coli 


830 


GTGTTacccagctcctctTCATCAAGAG 


Adenomatous 
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polyposis coli 


833 


AGCTCcTCTTCATCAA 


Adenomatous 
polyposis coli 


836 


TCATcAAGAGGAAGC 


Adenomatous 
polyposis coli 


848 


AAAGAtaGAAGTTTGGA 

) 


Adenomatous 
polyposis coli 


848 


AAAGataeaacTTTGGAGAGA 


Adenomatous 
polyposis coli 


855 


GAACgCGGAATTGGT 


Adenomatous 
polyposis coli 


856 


CGCGeaattGGTCTAGGCA 


Adenomatous 
polyposis coli 


856 


CGCGeAATTGGTCTA 


Adenomatous 
polyposis coli 


879 


CAGaTCTCCACCAC 


Adenomatous 
polyposis coli 


902 


GAAGAcaeaAGTTCTGGGT 


Adenomatous 
polyposis coli 


907 


GGGTcTACCACTGAA 


Adenomatous 
polyposis coli 


915 


GTGACaGATGAGAGAA 


Adenomatous 
polyposis coli 


929 


CATACacatTCAAACACTT 


Adenomatous 
polyposis coli 


930 


ACACAttcaAACACTTACA 


Adenomatous 
polyposis coli 


931 


CATtCAAACACTTA 


Adenomatous 
polyposis coli 


931 


CATTcAAACACTTAC 


Adenomatous 
polyposis coli 


933 


AACacttACAATTTCAC 


Adenomatous 
polyposis coli 


935 


TACAatttcactAAGTCGGAAA 


Adenomatous 
polyposis coli 
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937 


TTCActaaGTCGG A A A AT 


Adenomatous 
polyposis coli 


939 


AAGtceeAAAATTCAAA 


Adenomatous 
polyposis coli 


946 


ACATeTTCTATGCCT 


Adenomatous 
polyposis coli 


954 


TTAGaaTACAAGAGAT 


Adenomatous 
polyposis coli 


961 


AATff AT AGTTT A A A 


Adenomatous 
polyposis coli 


963 


A GTTTa A A T A GTGTf* A 


Adenomatous 
polyposis coli 


964 


TTAflatflGTGTP A OTA fi 

X A -riiXaUlUX VJX V^/WJ X/\VJ 


Adenomatous 
polyposis coli 


973 


T ATGff T A A A A G A GGT 


Adenomatous 
polyposis coli 


974 


GGTAAaAG AGGTr A A A 


Adenomatous 
polyposis coli 


975 


AAAAgaGGTCAAATGA 


Thyroid cancer 


992 


AGTAAgTTTTGCAGTT 


Thyroid cancer 


993 


AACtttocafftaTGGTr A ATA P 


Adenomatous 
polyposis coli 


999 


CAAtacccacrCCGACCTAGC 


Adenomatous 
polyposis coli 


1023 


ACACcAATAAATTAT 


Adenomatous 
polyposis coli 


1030 


AAAtATTCAGATGA 

*»*»-fxwk x x v^-Tv vj.iv. x 


Adenomatous 
polyposis coli 


1032 


TCAGatgagCAGTTGAACT 


Adenomatous 
polyposis coli 


1033 


GATGaGCAGTTGAAC 


Adenomatous 
polyposis coli 


1049 


TGGGcAAGACCCAAA 


Adenomatous 
polyposis coli 
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1054 


CACAtaataGAAGATGAAA 


Adenomatous 
polyposis coli 


1055 


ATAAtaeaaGATGAAATAA ' 


Adenomatous 
polyposis coli 


1056 


ATAGAaGATGAAATAA 


Adenomatous 
polyposis coli 


1060 


ATA A A flcaaaGTGA HP A A A G 


Adenomatous 
polyposis coli 


1061 


A A AcaaaGTG A GP A A AG 


Adenomatous 
polyposis coli 




AAA Can A GTG A GP AAA 


Adenomatous 
polyposis coli 


106? 


PAAAataflGPAA AGAPA A 


Adenomatous 
polyposis coli 


1065 

IvVJJ 


PAAAGarAATPA AGGA A 


Adenomatous 
polyposis coli 


1067 


P A AtcaaGG A ATP A A A G 

— - r — ' 


Adenomatous 
polyposis coli 


1071 


P A A A at AC A APTTA TP 


Adenomatous 
polyposis coli 


1079 


APTGaaAGPAPTGATG 


Adenomatous 
polyposis coli 


1082 


APTG Atff ATA A AP A PPT 


Adenomatous 
polyposis coli 


1084 


GATaaacACCTPAAGTT 

v*xx x (Uulv/l^v X ^/VrlvJ X X 


Adenomatous 
polyposis coli 


1086 


CAPPtc A AGTTPP A AP 


Adenomatous 
polyposis coli 


1093 


TTTGgACAGCAGGAA 


Adenomatous 
polyposis coli 


1098 


TGTgtTTCTCCATAC 


Adenomatous 
polyposis coli 


1105 


CGGgGAGCCAATGG 


Thyroid cancer 


1110 


TCAGAaACAAATCGAG 


Adenomatous 
polyposis coli 
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1121 


ATTAAteaa A ATOTA AGCC 


Adenomatous 
polyposis coli 


1131 

X XJ X 


CAAoAAGATGAPTA 


Adenomatous 
polyposis coli 




OAfTAtGA AG ATGATA 


Adenomatous 
polyposis coli 


1157 

X XJ / 


CJATafltflaGPPT APP A AT 


Adenomatous 
polyposis coli 


1146 


CCJTTAoTPTG A AG A AG 


Adenomatous 
polyposis coli 


X XJ^T 


lyAAUaagaaUAunu/iLVnA 


Adenomatous 
polyposis coli 


1 


fiA AnaamjfiAfiApp a apa 

IjrAAUaagaLrAUAv^AAl^A 


Adenomatous 
polyposis coli 


1 1 JO 


fA AcracTaflAPP A APA A A 


Adenomatous 
polyposis coli 


1 

1 IDO 


OA A (tci rrna A PHTP ATYTTG 

vrAAgagaaAv^o i 1 vj i u 


Adenomatous 
polyposis coli 


1 1 7» 

11 /o 


A TTA fa ottfa AAA T A Tf?PP A 


Adenomatous 
polyposis coli 


noi 


TTA AaATATPiPPAPA 


Adenomatous 
polyposis coli 


1184 


ftPPacfl aaT ATTPPTTP A 


Adenomatous 
polyposis coli 


1185 


AC A cmTA TTPPTTP A 


Adenomatous 
polyposis coli 


1190 


TCACAffAAACAGTPAT 


Adenomatous 
polyposis coli 


1192 

X x^x* 


AAAcaGTCATTTTPA 


Adenomatous 
polyposis coli 


1198 


TCAaaGAGTTCATCT 


Adenomatous 
polyposis coli 


1207 


AAAAcCGAACATATG 


Adenomatous 
polyposis coli 


1208 ACCgaacATATGTCTTC 


Adenomatous 
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* 


polyposis coli 


1210 


CATatGTPTTP A AGP 


Adenomatous 
polyposis coli 




PP A A GfTPTOP APA OA 
LLAA\JT 1 L 1 VJ L/ A L AU A 


Adenomatous 
polyposis coli 


10AG 


Tt~* f A oaOTTTr" 1 T v TV' ,, T A 
luLnoaUl 1 1L1 ILIA 


Adenomatous 
polyposis coli 


lZjy 


A J. ACaVjrAL/1 1AJL lvJl 


Adenomatous 
polyposis coli 


lZOU 


V> Air ALttA X ivjlvji. AVj A 


Adenomatous 
polyposis coli 




CCAaiAlOl ill 1L 


Adenomatous 
polyposis coli 




A 1~"T+r* A XT A TP A TC* 
A\3 ltLA 1 1 A 1 LA 1 L 


Adenomatous 
polyposis coli 




rArnAanrAnATTPTn 


Adenomatous 
polyposis coli 


1 ^ni 


APPPfOPA A ATAHPA 
ALtLtULAAA I AULA 


Adenomatous 
polyposis coli 


1 JUO 


PAA AfooaArJA A A AO ATT 
Vj-AAAlaaaAvjAAAAVjA I i 


Adenomatous 
polyposis coli 


T3rt7 


A TP A a A AOAAAAOAT 
A A A3AAVJAAAAVJ A I 


Adenomatous 
polyposis coli 




A A AffnaaA/^rATTnTrA AP 


Adenomatous 
polyposis coli 


1 'JfiR 
1 jUO 


A A AO A QoaftoTTrjrJA A PT A O 

AAAUrAaaaga 1 1 uuAAU 1 avj 


Adenomatous 
polyposis coli 


1 J 10 


n A TPnTOTO A OPO A A 
Lr A 1 LC lul uAVJLuAA 


Adenomatous 
polyposis coli 


1320 


GTGAGcGAAGTTCCAG 


Adenomatous 
polyposis coli 


1323 


GTTCcAGCAGTGTCA 


Adenomatous 
polyposis coli 


1329 


CACCctagaaccAAATCCAGCA 


Adenomatous 
polyposis coli 
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1336 


AGACteCAGGGTTCTA 


Adenomatous 
polyposis coli 


1338 


CAGeGTTCTAGTTT 


Adenomatous 
polyposis coli 


1340 


TCT A eTTTATCTTC A 


Adenomatous 
polyposis coli 


1342 


TTATeTTPAGAATPA 


Adenomatous 
polyposis coli 


1352 


fJTTffA ATTTTPTTP 


Adenomatous 
polyposis coli 


1361 


CCCTcCAAAAGTGGT 


Adenomatous 
polyposis coli 


1364 


AGTeotePTP AG AP APP 


Adenomatous 
polyposis coli 


1171 

U 1 X 


AflTPPfiePTOA AP APT A 


Adenomatous 
polyposis coli 


1372 


CCACCtGAACACTATCl 


Adenomatous 
polyposis coli 


1376 


TATGttP A GGA GA PPP 


Adenomatous 
polyposis coli 


1394 


^ AT A PtTTTG A G A OTP 


Adenomatous 
polyposis coli 


1401 


ATTGCcAGCTCCGTTC 


Adenomatous 
polyposis coli 


1415 


AGTGGcATTATAAGCC 


Adenomatous 
polyposis coli 


1426 


AGCCcTGGACAAACC 


Adenomatous 
polyposis coli 


1427 


CCTGGaCAAACCATGC 


Adenomatous 
polyposis coli 


1431 


ATGCcACCAAGCAGA 


Adenomatous 
polyposis coli 


1454 


AAAAAtAAAGCACCTA 


Adenomatous 
polyposis coli 


1461 


GAAaAGAGAGAGAG 


Adenomatous 
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polyposis coli 


1463 


AGAgagaGTGGACCTAA 


Adenomatous 
polyposis coli 


1464 

i 


GAGAgTGGACCTAAG 


Adenomatous 
polyposis coli 


1464 


GAGAgtGGACCTAAGC 


Adenomatous 
polyposis coli 


1464 


GAGagTGGACCTAAG 


Adenomatous 
polyposis coli 


1492 


GCCaCGGAAAGTAC 

X* v wMwVp/\/i AX At AV^ AX *> 


Adenomatous 
polyposis coli 


1493 


ACGGAaAGTACTCCAG 


Adenomatous 
polyposis coli 


1497 


CCAeATGGATTTTC 


Adenomatous 
polyposis coli 


1503 


TCAtccaGCCTGAGTGC 


Adenomatous 
polyposis coli 


1522 


TTAaeaataaTGCCTCCAGT 


Adenomatous 
polyposis coli 


1536 


GAAACagAATCAGAGCA 


Adenomatous 
polyposis coli 


1545 | 


TCAAAtgaaaACCAAGAGAA 


Adenomatous 
polyposis coli 


1547 


GAAaACCAAGAGAA 


Adenomatous 
polyposis coli 


1550 


GAGAaagaGGCAGAAAAA 


Adenomatous 
polyposis coli 


1577 


GAATgtATTATTTCTG 


Adenomatous 
polyposis coli 


1594 


CCAGCcCAGACTGCTT 


Adenomatous 
polyposis coli 


1596 


CAGACtGCTTCAAAAT 


Adenomatous 
polyposis coli 


1823 


TTCAaTGATAAGCTC 


Adenomatous 
polyposis coli 
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1859 


AATGAttctTTGAGTTCTC 


Adenomatous 
polyposis coli 


1941 


CCAGAcagaGGGGCAGCAA 


Desmoid tumours 


1957 


GAAaATACTCCAGT 


Adenomatous 
polyposis coli 


1080 


AAfaATAAAGAAAA 


Adenomatous 
polyposis coli 


iyoj 


OAAPPtATrAAAGAGA 


Adenomatous 
polyposis coli 




rrTaTPA A AG AO AP 


Adenomatous 
polyposis coli 


1770 


OAACcAAGTAAAPPT 


Adenomatous 
polyposis coli 


2044 


AGCTCcGCAATGCCAA 


Adenomatous 
polyposis coli 


2556 


TCATCccttcctcGAGTAAGCAC 


Adenomatous 
polyposis coli 


2643 


CTAATttatCAAATGGCAC 


Adenomatous 
polyposis coli 



TABLE X: SMALL INSERTIONS 



Codon 


Insertion 


Phenotype 


157 


T 


Adenomatous polyposis coli 


170 


AGAT 


Adenomatous polyposis coli 


172 


T 


Adenomatous polyposis coli 


199 


G 


Adenomatous polyposis coli 


243 


AG 


Adenomatous polyposis coli 


266 


T 


Adenomatous polyposis coli 


357 


A 


Adenomatous polyposis coli 


405 


C 


Adenomatous polyposis coli 


413 


T 


Adenomatous polyposis coli 


416 


A 


Adenomatous polyposis coli 
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457 


G 


Adenomatous polyposis coli 


473 


A 


Adenomatous polyposis coli 


503 


ATTC 


Adenomatous polyposis coli 


519 


C 


Adenomatous polyposis coli 


528 


A 


Adenomatous polyposis coli 


561 


A 


Adenomatous polyposis coli 


608 


A 


Adenomatous polyposis coli 


620 


CT 


Adenomatous polyposis coli 


621 


A 


Adenomatous polyposis coli 


623 


TTAC 


Adenomatous polyposis coli 


627 


A 


Adenomatous polyposis coli 


629 


A 


Adenomatous polyposis coli 


636 


GT 


Adenomatous polyposis coli 


639 


A 


Adenomatous polyposis coli 


704 


T 


Adenomatous polyposis coli 


740 


ATGC 


Adenomatous polyposis coli 


764 


T 


Adenomatous polyposis coli 


779 


TT 


Adenomatous polyposis coli 


807 


AT 


Adenomatous polyposis coli 


827 


AT 


Adenomatous polyposis coli 


831 


A 


Adenomatous polyposis coli 


841 


CTTA 


Adenomatous polyposis coli 


865 


CT 


Adenomatous polyposis coli 


865 


AT 


Adenomatous polyposis coli 


900 


TG 


Adenomatous polyposis coli 


921 


G 


Adenomatous polyposis coli 


927 


A 


Adenomatous polyposis coli 


935 


A 


Adenomatous polyposis coli 


936 


C 


Adenomatous polyposis coli 


975 


A 


Adenomatous polyposis coli 


985 


T 


Adenomatous polyposis coli 


997 


A 


Adenomatous polyposis coli 


1010 


TA 


Adenomatous polyposis coli 


1085 


C 


Adenomatous polyposis coli 
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1085 


AT 


Adenomatous polyposis coli 


1095 


A 


Adenomatous polyposis coli 


1100 


GTTT 


Adenomatous polyposis coli 


1107 


GGAG 


Adenomatous polyposis coli 


1120 


G 


Adenomatous polyposis coli 


1166 


A 


Adenomatous polyposis coli 


1179 


T 


Adenomatous polyposis coli 


1187 


A 


Adenomatous polyposis coli 


1211 


T 


Adenomatous polyposis coli 


1256 


A 


Adenomatous polyposis coli 


1265 


T 


Adenomatous polyposis coli 


1267 


GATA 


Adenomatous polyposis coli 


1268 


T 


Adenomatous polyposis coli 


1301 


A 


Adenomatous polyposis coli 


1301 


C 


Adenomatous polyposis coli 


1323 


A 


Adenomatous polyposis coli 


1342 


T 


Adenomatous polyposis coli 


1382 


T 


Adenomatous polyposis coli 


1458 


GTAG 


Adenomatous polyposis coli 


1463 


AG 


Adenomatous polyposis coli 


1488 


T 


Adenomatous polyposis coli 


1531 


A 


Adenomatous polyposis coli 


1533 


T 


Adenomatous polyposis coli 


1554 


A 


Adenomatous polyposis coli 


1555 


A 


Adenomatous polyposis coli 


1556 


T 


Adenomatous polyposis coli 


1563 


GACCT 


Adenomatous polyposis coli 


1924 


AA 


Desmoid tumours 



TABLE XI: SMALL INSERTIONS/DELETIONS 



Location/ 
codon 


Deletion 


Insertion 


Phenotype 


538 


GAAGAcTTACAGCAGG 


gaa 


Adenomatous polyposis 
coli 


620 


CTTACttaCCGGAGCCAG 


ct 


Adenomatous polyposis 
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coli 


728 


AATctcatGGCAAATAGG 


ttgcagctttaa 


Adenomatous polyposis 
coli 


971 


GATGgtTATGGTAAAA 


taa 


Adenomatous polyposis 
coli 


TABLE XH: GROSS DELETIONS 



2 kb including ex. 1 1 


rA.UCliUlIld.LUU.b pUiypUbld V>UI1 


3 kb I10E1 1-1.5 kb to I12E13-170 bp 


Adenomatous polyposis coli 


335 bp nt 1409-1743 ex. 11-13 


Adenomatous polyposis coli 


6 kb incl. ex. 14 


Adenomatous polyposis coli 


817 bp I13E14-679 to I13E14+138 


Adenomatous polyposis coli 


ex. 11-15M 


Adenomatous polyposis coli 


ex. 11-3'UTR 


Adenomatous polyposis coli 


ex. 15A-ex. 15F 


Adenomatous polyposis coli 


ex. 4 


Adenomatous polyposis coli 


ex. 7, 8 and 9 


Adenomatous polyposis coli 


ex. 8 to beyond ex. 15F 


Adenomatous polyposis coli 


ex. 8 -ex. 15F 


Adenomatous polyposis coli 


ex.9 


Adenomatous polyposis coli 


>10mb (del 5q22) 


Adenomatous polyposis coli 



TABLE XIII: GROSS INSERTIONS AND DUPLICATIONS 



Description 


Phenotype 


Insertion of 14 bp nt 3816 


Adenomatous polyposis coli 


Insertion of 22 bp nt. 4022 


Adenomatous polyposis coli 


Duplication of 43 bp cd. 1295 


Adenomatous polyposis coli 


Insertion of 337 bp of Alu I sequence cd. 1526 


Desmoid tumours 



170 



WO 2004/078994 



PCT7US2004/006337 



TABLE XIV: COMPLEX REARRANGEMENTS (INCLUDING INVERSIONS) 



A-T nt 4893 Q1625H, Del C nt. 4897 cd. 1627 


Adenomatous polyposis coli 


Del 1099 bp I13E14-728 to E14I14+156, ins 126 bp 


Adenomatous polyposis coli 


Del 1601 bp E14I14+27 to E14I14+1627, ins 180 bp 


Adenomatous polyposis coli 


Del 310 bp, ins. 15 bp nt. 4394, cd 1464 


Adenomatous polyposis coli 


Del A and Ted. 1395 


Adenomatous polyposis coli 


Del TC nt. 4145, Del TGT nt. 4148 


Adenomatous polyposis coli 


Del. T, nt. 983, Del. 70 bp, nt. 985 


Adenomatous polyposis coli 


Del. nt. 3892-3903, ins ATTT 


Adenomatous polyposis coli 


TABLE XV: DIAGNOSTIC APPLICATIONS 



Cancer Type 


Marker 


Application 


Reference 


Breast 


Her2/Neu 


Using methods described herein, 


D. Xie etal.,J. 




L/eieciion — 


design second primer such that after 


Natl. Lancer 




polymorphism at 


PCR, and digestion with restriction, 


Institute^!, 




codon 655 


enzyme, a 5' overhang containing 


412 (2000) 




(GTC/valine to 


DNA sequence for codon 655 of 






ATC/isoleucine 


Her2/Neu is generated. 


K.S. Wilsons 




[Val(655)Ile]) 




al.,Am. J. 






Her2/Neu can be detected and 


PatholtWhU 






quantified as a possible marker for 


71 (2002) 






breast cancer. Methods described 








herein can detect both mutant allele 


L. Newman, 






and normal allele, even when mutant 


Cancer 






allele is small fraction of total DNA. 


Controls, 473 








(2002) 






Herceptin therapy for breast cancer is 








based upon screening for Her2. The 








earlier the mutant allele can be 








detected, the faster therapy can be 








provided. 
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Breast/Ovarian 


Hypermethylation 


Methods described herein can be used 


M.Esteller et 




ofBRCAl 


to differentiate between tumors 


al, New 






resulting from inherited BRCA1 


England M 






mutations and those from non- 


Med 9 344, 539 






inherited abnormal methylation of 


(2001) 






the gene 




Bladder 


Microsatellite 


Methods described herein can be 


W.G. Bas et 




analysis of free 


applied to microsatellite analysis and 


al t Clinical 




tumor DNA in 


FGFR3 mutation analysis for 


Cancer 




Urine, Serum and 


detection of bladder cancer. Methods 


Res.,9,257 




Plasma 


described herein provide a non- 


(2003) 






invasive method for detection of . 








bladder cancer. 


M. Utting et 








al., Clincal 








Cancer Res., 








8,35 (2002) 






i 


L. Mao, 








D.Sidransky et 








al, 








Science 9 271, 








669(1996) 


Lung 


Microsatellite 


Methods described herein can be used 


T.Liloglou et 




analysis of DNA 


to detect mutations in sputum 


al, Cancer 




from sputum 


samples, and can markedly boost the 


Research,61, 






accuracy of preclinical lung cancer 


1624, (2001) 






screening 










M. Tockman et 








al, Cancer 








Control,!, 19 








(2000) 








Field etal, 








Cancer 








Research 9 59, 
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2690 (1999) 


Cervical 


Analysis of HPV 
genotype 


Methods described herein can be used 
to detect HPV genotype from a 
cervical smear preparation. 


N. Munoz et 
al., New 
England Jnl 
Med., 348, 518 
(2003) 


Head and 
Neck 


Tumor specific 
alterations in 
exfoliated oral 
mucosal cells 
(microsatellite 
markers) 


Methods described herein can be used 
to detect any of 23 microsatellite 
markers, which are associated with 
Head and Neck Squamous Cell 
Carcinoma (HNSCC). 


M. Spafford et 
al Clinical 
Cancer 

Research,!!* 
607 (2001) 

A. El-Naggar et 
al. t J. Mol 
Diag.,3,164 
(2001) 


Colorectal 


Screening for 
mutation in K-ras2 
and APC genes. 


Methods described herein can be used 
to detect K-ras 2 mutations, which ' 
can be used as a prognostic indicator 
for colorectal cancer. 

APC (see Example 5). 


B. Ryan et al 

Gttf,52,101 

(2003) 


Prostate 


GSTP1 

Hypermethylation 


Methods described herein can be used 
to detect GSTP1 hypermethylation in 
urine from patients with prostate 
cancer; this can be a more accurate 
indicator than PSA. 


P. Caims et al. 
Clin. Can. 
Res.,7,2727 
(2001) 



HIV 



Antiretroviral 
resistance 


Screening 
individuals for 
mutations in HIV 
virus -e.g. 154V 
mutation or 


Methods described herein can be used foi 
detection of mutations in the HTV virus. 
Treatment outcomes are improved in 
individuals receiving anti-retroviral theraj 
based upon resistance screening. 


J. Durant et al. 

The 

^Lance&SZ, 
2195(1999) 
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CCR5A32 
allele. 


♦ 




Cardiology 


Congestive 
Heart Failure 


Synergistic 

polymorphisms 

of betal and 

alpha2c 

adrenergic 

receptors 


Methods described herein can be used 
to genotype these loci and may help 
identify people who are at a higher risk 
of heart failure. 


K.Small etal 
New Eng. Jnl 
Med, 347,1135 
(2002) 



5 

EXAMPLE 8 

Single nucleotide polymorphisms (SNPs) represent the most common form of sequence 
variation; three million common SNPs with a population frequency of over'5% have been estimated 
to be present in the human genome. A genetic map using these polymorphisms as a guide is being 

10 developed (http://research.marshfieldclinic.org/genetics/; internet address as of February 13, 2003). 

The allele frequency varies from SNP to SNP; the allele frequency for one SNP may be 
50:50, while the allele frequency for another SNP may be 90:10. The closer the allele frequency is 
to 50:50, the more likely any particular individual will be heterozygous at that SNP. The SNP 
consortium provides allele frequency information for some SNPs but not for others. 

15 www.snp.chsl.org. The allele frequency for a particular SNP provides valuable information as to 
the utility of that SNP for the non-invasive prenatal screening method described in Example 5. 
While all SNPs can be used, SNPs with allele frequencies closer to 50:50 are preferable. 

Briefly, maternal blood contains fetal DNA. Maternal DNA can be distinguished from fetal 
DNA by examining SNPs wherein the mother is homozygous. For example, at SNP X, the 

20 maternal DNA may be homozygous for guanine. If template DNA obtained from the plasma of a 
pregnant female is heterozygous, as demonstrated by the detection of signals corresponding to an 
adenine allele and an guanine allele, the adenine allele can be used as a beacon for the fetal DNA 
(see Example 5). The closer the allele frequency of a SNP is to 50:50, the more likely there will be 
allele differences at a particular SNP between the maternal DNA and the fetal DNA. 

25 For example, if at SNP X the observed alleles are adenine and guanine, and the SNP has an 

allele frequency of 90(A): 10(G), it is likely that both mother and father will be homozygous for 
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adenine at that particular SNP. Thus, both the maternal DNA and the fetal DNA will be 
homozygous for adenine, and there is no distinct signal for the fetal DNA. However, if at SNP X 
the allele frequency is 50:50, and the mother is homozygous for adenine, the probability is higher 
that the paternal DNA will contain a guanine allele at SNP X. 

Below, a method for determining the allele frequency for a SNP is provided. Seven SNPs 
located on chromosome 13 were analyzed. The method is applicable for any SNP including but not 
limited to the SNPs on human chromosomes 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21,22, X and Y. 

Preparation of Template DNA , 

To determine the allele frequency of a particular SNP, DNA was obtained from two 
hundred and fifty individuals after informed consent had been granted. From each individual, a 9 
ml blood sample was collected into a sterile tube (Fischer Scientific, 9 ml EDTA Vacuette tubes, 
catalog number NC9897284). The tubes were spun at 1000 rpm for ten minutes. The supernatant 
(the plasma) of each sample was removed, and one milliliter of the remaining blood sample, which 
is commonly referred to as the "bufly-coaf ' was transferred to a new tube. One milliliter of IX 
PBS was added to each sample. 

Template DNA was isolated using the QIAmp DNA Blood Midi Kit supplied by QIAGEN 
(Catalog number 5 1 183). The template DNA was isolated as per instructions included in the kit. 
From each individual, 0.76 \ig of DNA was pooled together, and the pooled DNA was used in all 
subsequent reactions. 

Design of Primers 

SNP TSC0903430 was amplified using the following primer set: 

First primer: 

5' GTCTTGCATGTAGAATTCTAGGGACGCTGCTTTTCGTC V 

Second primer: 
5' CTCCTAGACATCGGGACTAGAATGTCCAC 3' 
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The first primer contained a recognition site for the restriction enzyme EcoRI, and was 
designed to anneal eighty-two bases from the locus of interest. The second primer contained the 
recognition site for the restriction enzyme BsmF I. 

5 SNP TSC0337961 was amplified using the following primer set: 

First primer: 

5' ACACAAGGCAGAGAATTCCAGTCCTGAGGGTGGGGGCC 3' 

10 

Second primer: 

5' CCGTGTTTTAACGGGACAAGCTGTTCTTC 3' 

1 5 The first primer contained a recognition site for the restriction enzyme EcoRI, and was 

designed to anneal ninety-two bases from the locus of interest. The second primer contained the 
recognition site for the restriction enzyme BsmF I. 

SNP TSC0786441 was amplified using the following primer set: 

20 

First primer: 

5' GTAGCGGAGGTTGAATTCTATATGTTGTCTTGGACATT 3' 

25 Second primer: 

5* CATCAGTAGAGTGGGACGAAAGTTCTGGC 3' 

The first primer contained a recognition site for the restriction enzyme EcoRI, and was 
30 designed to anneal one hundred and four bases from the locus of interest. The second primer 
contained the recognition site for the restriction enzyme BsmF I. 

SNP TSC1 168303 was amplified using the following primer set: , 

35 First primer: 
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5' ATCCACGCCGCAGAATTCGTATTCATGGGCATGTCAAA 3' 

Second primer: 

5' CTTGGGACTATTGGGACCAGTGTTCAATC 3' 

The first primer contained a recognition site for the restriction enzyme EcoRI, and was 
designed to anneal sixty-four bases from the locus of interest. The second primer contained the 
recognition site for the restriction enzyme BsmF I. 

SNP TSC0056188 was amplified using the following primer set: 

First primer: 

5' CCAGAAAGCCGTGAATTCGTTAAGCCAACCTGACTCCA 3' 

Second primer: 

5' TCGGGGTTAGTCGGGACATCCAGCAGCCC 3' 

The first primer contained a recognition site for the restriction enzyme EcoRI, and was 
designed to anneal eighty-two bases from the locus of interest. The second primer contained the 
recognition site for the restriction enzyme BsmF I. 

SNP TSC0466177 was amplified using the following primer set: 

t 

First primer: 

5' CGAAGGTAATGTGAATTCCAAAACTTAGTGCCACAATT 3' 

Second primer: 
5' ATACCGCCCAACGGGACAGATCCATTGAC 3' 
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The first primer contained a recognition site for the restriction enzyme EcoRI, and was 
designed to anneal ninety-two bases from the locus of interest. The second primer contained the 
recognition site for the restriction enzyme BsmF I. 

SNP TSCO 197424 was amplified using the following primer set: 

First primer: 

5' AGAAACCTGTAAGAATTCGATTCCAAATTGTTTTTTGG 3* 

Second primer: 
5* CGATCATAGGGGGGGACAGGAGAGAGCAC 3' 

The first primer contained a recognition site for the restriction enzyme EcoRI, and was 
designed to anneal one hundred and four bases from the locus of interest. The second primer 
contained the recognition site for the restriction enzyme BsmF I. 

The first primer was designed to anneal at various distances from the locus of interest. The 
skilled artisan understands that the annealing location of the first primer can be any distance from 
the locus of interest including but not limited to 5-10, 1 1-15, 16-20, 21-25, 26-30, 31-35, 36-40, 41- 
45, 46-50, 51-55, 56-60, 61-65, 66-70, 71-75, 76-80, 81-85, 86-90, 91-95, 96-100, 101-105, 106- 
110, 111-115, 116-120, 121-125, 126-130, 131-140, 141-160, 161-180, 181-200,201-220, 221-240, 
241-260, 261-280, 281-300, 301-350, 351-400, 401-450, 451-500, 501-100D, 1001-2000, 2001- 
3000, or greater than 3000. 

All loci of interest were amplified from the template genomic DNA using the polymerase 
chain reaction (PCR, U.S. Patent Nos. 4,683,195 and 4,683,202, incorporated herein by reference). 
In this example, the loci of interest were amplified in separate reaction tubes but they can also be 
amplified together in a single PCR reaction. For increased specificity, a "hot-start" PCR was used. 
PCR reactions were performed using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog 
number 203443). The amount of template DNA and primer per reaction can be optimized for each 
locus of interest. In this example, 40 ng of template human genomic DNA (a mixture of template 
DNA from 245 individuals) and 5 ^M of each primer were used. Forty cycles of PCR were 
performed. The following PCR conditions were used: 

(1) 95°C for 15 minutes and 15 seconds; 

(2) 37°C for 30 seconds; 
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(3) 95°C for 30 seconds; 

(4) 57°Cfor 30 seconds; 

(5) 95°C for 30 seconds; 

(6) 64°Cfor30 seconds; 
5 (7) 95°C for 30 seconds; 

(8) Repeat steps 6 and 7 thirty nine (39) times; 

(9) 72°C for 5 minutes. 

In the first cycle of PCR, the annealing temperature was about the melting temperature of 
the 3 ' annealing region of the second primers, which was 37°C. The annealing temperature in the 

1 0 second cycle of PCR was about the melting temperature of the 3 ! region, which anneals to the 

template DNA, of the first primer, which was 57°C. The annealing temperature in the third cycle of 
PCR was about the melting temperature of the entire sequence of the second primer, which was 
64°C. The annealing temperature for the remaining cycles was 64°C. Escalating the annealing 
temperature from TM1 to TM2 to TNG in the first three cycles of PCR greatly improves specificity. 

15 These annealing temperatures are representative, and the skilled artisan will understand the 
annealing temperatures for each cycle are dependent on the specific primers used. 

The temperatures and times for denaturing, annealing, and extension, can be optimized by 
trying various settings and using the parameters that yield the best results. 

20 Purification of Fragment of Interest 

The PCR products were separated from the unused PCR reagents. After the PCR reaction, 
1/2 of the reaction volume for SNP TSC0903430, SNP TSC0337961, and SNP TSC0786441 were 
mixed together in a single reaction tube. One-half the reaction volumes for SNPs TSC1 168303, 
TSC0056188, TSC0466177, and TSC0197424 were pooled together in a single reaction tube. The 
25 un-used primers, and nucleotides were removed from the reaction by using Qiagen MinElute PCR 
purification kits (Qiagen, Catalog Number 28004). The reactions were performed following the 
manufacturer's instructions supplied with the columns. 

Restriction Enzyme Digestion of Isolated Fragments 
30 The purified PCR products were digested with the restriction enzyme BsmF I, which binds 

to the recognition site incorporated into the PCR products from the second primer. The digests 
were performed in eppendorf tubes following the instructions supplied with the restriction enzyme. 



179 



WO 2004/078994 



PCT/US2004/006337 



Incorporation of Labeled Nucleotide 

The restriction enzyme digest with BsmF I yielded a DNA fragment with a 5' overhang, 
which contained the SNP site or locus of interest and a 3' recessed end. The 5 1 overhang functioned 
as a template allowing incorporation of a nucleotide or nucleotides in the presence of a DNA 
5 polymerase. 

As discussed in detail in Example 6, the sequence of both alleles of a SNP can be 
determined with one labeled nucleotide in the presence of the other unlabeled nucleotides. The 
following components were added to each fill in reaction: 1 |xl of fluorescently labeled ddGTP, 0.5 
\il of unlabeled ddNTPs ( 40 jiM), which contained all nucleotides except guanine, 2 |xl of 10X 

1 0 sequenase buffer, 0.25 of Sequenase, and water as needed for a 20|il reaction. The fill in 
reaction was performed at 40°C for 10 min. Sequenase was the DNA polymerase used in this 
example. However, any DNA polymerase can be used for a fill-in reaction including but not 
limited to E. coli DNA polymerase, Klenow fragment of E. coli DNA polymerase I, 17 DNA 
polymerase, T4 DNA polymerase, Taq polymerase, Pfu DNA polymerase, Vent DNA polymerase, 

15 polymerase from bacteriophage 29, and REDTaq™ Genomic DNA polymerase. Non-fluorescently 
labeled ddNTP was purchased from Fermentas Inc. (Hanover, MD). All other labeling reagents 
were obtained from Amersham (Thermo Sequenase Dye Terminator Cycle Sequencing Core Kit, 
US 79565). 



20 Detection of the Locus of Interest 

The sample was loaded into a lane of a 36 cm 5% acrylamide (urea) gel (BioWhittaker 
Molecular Applications, Long Ranger Run Gel Packs, catalog number 50691). The sample was 
electrophoresed into the gel at 3000 volts for 3 min. The gel was run for 3 hours on a sequencing 
apparatus (Hoefer SQ3 Sequencer). The gel was removed from the apparatus and scanned on the 
25 Typhoon 9400 Variable Mode Imager. The incorporated labeled nucleotide was detected by 
fluorescence. 

Below, a schematic of the 5' overhang for SNP TSC0056188 is reproduced (where R 
indicates the variable site). The entire sequence is not shown, only a portion of the overhang. 



30 5'CCA 

3'GGT R T C C 

Overhang position 12 3 4 



As discussed in detail in Example 6, one nucleotide labeled with one chemical moiety can 
35 be used to determine the sequence of the alleles of a locus of interest. The observed nucleotides for 
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TSC00561 88 on the 5' sense strand (here depicted as the top strand) are adenine and guanine. The 
third position in the overhang on the antisense strand is cytosine, which is complementary to 
guanine. As the variable site can be adenine or guanine, fluorescently labeled ddGTP in the 
presence of unlabeled dCTP, dTTP, and dATP was used to determine the sequence of both alleles. 
The fill-in reactions for an individual homozygous for guanine, homozygous for adenine or 
heterozygous are diagrammed below. 



Homozygous adenine: 

5'CCA A A G* 

3'GGT T T C C 

Overhang position 12 3 4 

Homozygous guanine: 

5'CCA G* 

3'GGT C T C C 

Overhang position 12 3 4 

Heterozygous: 

Allele 1 5'CCA G* 

3'GGT C T C C 

Overhang position 12 3 4 

Allele 2 5'CCA A A G* 

3'GGT T T C C 

Overhang position 12 3 4 



As seen in FIG. 14, two bands were detected for SNP TSC0056188. The lower band 
corresponded to DNA molecules filled in with ddGTP at position one complementary to the 
overhang, which is representative of the guanine allele. The higher band, separated by a single base 
from the lower band, corresponded to DNA molecules filled in with ddGTP at position 3 
complementary to the overhang. This band represented the adenine allele. The intensity of each 
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band was strong, indicating that each allele was well represented in the population. SNP 
TSC0056188 is representative of a SNP with high allele frequency. 

Below, a schematic of the 5' overhang generated after digestion with BsmF I for SNP 
TSC0337961 is reproduced (where R indicates the variable site). The entire sequence is not shown, 
5 only a portion of the overhang. 

5* GCCA 

3'CGGT R G C T 
Overhang position 12 3 4 

10 

The observed nucleotides for SNP TSC0337961 on the 5' sense strand (here depicted as the 
top strand) are adenine and guanine. The third position in the overhang on the antisense strand was 
cytosine, which is complementary to guanine. As the variable site can be adenine or guanine, 
fluorescently labeled ddGTP in the presence of unlabeled dCTP, dTTP, and dATP was used to 
15 determine the sequence of both alleles. The fill-in reactions for an individual homozygous for 
guanine, homozygous for adenine or heterozygous are diagrammed below. 

Homozygous for guanine: 

20 5' GCCA G* 

3'CGGT C G C T 
Overhang position 12 3 4 



25 



Homozygous for adenine: 



5' GCCA A C G* 

3'CGGT T G C T 

Overhang position 12 3 4 

30 Heterozygous 

Allele 1 5' GCCA G* 

3'CGGT C G C T 



Overhang position 12 3 4 



35 



182 



WO 2004/078994 



PCT/US2004/006337 



Allele 2 5* GCCA A C G* 

3' CGGT T G C T 
Overhang position 12 3 4 

5 As seen in FIG. 14, one band migrating at the position of the expected lower molecular 

weight band was observed. This band represented the DNA molecules filled in with ddGTP at 
position one complementary to the overhang, which represents the guanine allele. No band 
corresponding to the DNA molecules filled in with ddGTP at position 3 complementary to the 
overhang was detected. SNP TSC0337961 is representative of a SNP that is not highly variable 

1 0 within the population. 

Of the seven SNPs analyzed, four of the SNPs (TSC1 168303, TSC0056188, TSC0466177, 
and TSCO 197424 had high allele frequencies. Two bands of high intensity were seen for each of 
the four SNPs, indicating that both alleles were well represented in the population. 

However, it is not necessary that the SNPs have allele frequencies of 50:50 to be useful. 

1 5 All SNPs provide useful information. The methods described herein provide a rapid technique for 
determining the allele frequency of a SNP, or any variable site including but not limited to point 
mutations. Allele frequencies of 50:50, 51:49, 52:48, 53:47, 54:46, 55:45, 56:46, 57:43, 58:42, 
59:41, 60:40, 61:39, 62:38, 63:37, 64:36, 65:35, 66:34, 67:33, 68:32, 69:31, 70:30, 71:29, 72:28, 
73:27, 74:26, 75:25, 76:24, 77:23, 78:22, 79:21, 80:20, 81:19, 82:18, 83:17, 84:16, 85:15, 86:14, 

20 87:13, 88:12, 89:11, 90:10, 91:9, 92:8, 93:7, 94:6, 95:5, 96:4, 97:3, 98:2, 99:1 and 100:0 can be 
useful. 

Two bands were seen for SNP TSC0903430. One band, the lower molecular weight band 
represented the DNA molecules filled in with labeled ddGTP. A band of weaker intensity was seen 
for the molecules filled in with labeled ddGTP at position 3 complementary to the overhang, which 
25 represented the cytosine allele. SNP TSC0903430 represents a SNP with low allele frequency 
variation. In the population, the majority of individuals carry the guanine allele, but the cytosine 
allele is still present. 

One band of high intensity was seen for SNP TSC0337961 and SNP TSC0786441. The 
band detected for both SNP TSC0337961 and SNP TSC0786441 corresponded to the DNA 
30 molecules filled in with ddGTP at position 1 complementary to the overhang. No signal was 
detected from DNA molecules that would have been filled in at position 3 complementary to the 
overhang, which would have represented the second allele. SNP TSC0337961 and SNP 
TSC0786441 represent SNPs with little variability in the population. 

As demonstrated in FIG 14., the first primer used to amplify each locus of interest can be 
35 designed to anneal at various distances from the locus of interest This allows multiple SNPs to be 
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analyzed in the same reaction. By designing the first primer to anneal at specified distances from 
the loci of interest, any number of loci of interest can be analyzed in a single reaction including but 
not limited to 1-10, 11-20, 21-30, 31-40, 41-50, 51-60, 61-70, 71-80, 81-90, 91-100, 101-110, 111- 
120, 121-130, 131-140, 141-150, 151-160, 161-170, 171-180, 181-190, 191-200,201-300,301-400, 
5 401-500, and greater than 500. 

As discussed in Example 6, some type lis restriction enzymes display alternate cutting 
patterns. For example, the type US restriction enzyme BsmF I typically cuts 10/14 from its binding 
site; however, the enzyme also can cut 1 1/15 from the binding she. To eliminate the effect of the 
alternate cut, the labeled nucleotide used for the fill-in reaction should be chosen such that it is not 
10 complementary to position 0 of the overhang generated by the 11/15 cut (discussed in detail in 
Example 6). For instance, if you label with ddGTP, the nucleotide preceding the variable site on 
the strand that is filled in should not be a guanine. 

The 1 1/15 overhang generated by BsmF I for SNP TSC0056188 is depicted below, with the 
variable site in bold-typeface: 

15 

1 1/15 Overhang for TSC0056188 



Allele 1 5'CC 

3'GG T C T C 

20 Overhang position 0 12 3 



Allele 2 5'CC 

3'GG T T T C 

Overhang position 0 12 3 

25 

After the fill-in reaction with labeled ddGTP, unlabeled dATP, dTTP, and dCTP, the 
following molecules were generated: 

11/15 Allele 1 5'CC A G* 

30 3'GG T C T C 

Overhang position 0 12 3 



11/15 Allele 2 5'CC A A A G* 

3'GG T T T C 

35 Overhang position 0 12 3 
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Two signals were seen; one band corresponded to molecules filled in with ddGTP at 
position one of the overhang, and the other band corresponded to the molecules filled in with 
ddGTP at position 3 complementary to the overhang. These are the same DNA molecules 
5 generated after the fill-in reaction of the 10/14 overhang. Thus, the two bands can be compared 
without any ambiguity from the alternate cut. This method of labeling with, a single nucleotide 
eliminates any errors generated from the alternate cutting properties of the enzymes. 

The methods described herein is applicable to determining the allele frequency of any SNP 
including but not limited to SNPs on human chromosomes 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 
10 15, 16, 17, 18, 19, 20, 21, 22, X and Y. 

EXAMPLE 9 

Heterozygous SNPs, by definition, differ by one nucleotide. At a heterozygous SNP, allele 
15 1 and allele 2 may be present at a ratio of 1 : 1 . However, it is possible that DNA polymerases can 
incorporate one nucleotide at a faster rate than other nucleotides, and thus the observed ratio of a 
heterozygous SNP may differ from the theoretically expected 1:1 ratio. 

Below, methods are described that allow efficient and accurate quantitation for the expected 
ratio of allele 1 to allele 2 at a heterozygous SNP. 

20 

Preparation of Template DNA 

Template DNA was obtained from twenty-four individuals after inlbrnied consent had been 
granted. From each individual, a 9 ml blood sample was collected into a sterile tube (Fischer 
Scientific, 9 ml EDTA Vacuette tubes, catalog number NC9897284). The tubes were spun at 1000 
25 rpm for ten minutes without brake. The supernatant (the plasma) of each sample was removed, and 
one milliliter of the remaining blood sample, which is commonly referred to as the "buflfy-coaf 9 
was transferred to a new tube. One milliliter of IX PBS was added to each sample. 

Template DNA was isolated using the QIAmp DNA Blood Midi Kit supplied by QIAGEN 
(Catalog number 511 83). The template DNA was isolated as per instructions included in the kit. 

30 

Design of Primers 

SNP TSC0607185 was amplified using the following primer set: 
35 First primer: 
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5' ACTTGATTCCGTGAATTCGTTATCAATAAATCTTACAT 3' 

Second primer: 
5* CAAGTTGGATCCGGGACCCAGGGCTAACC 3' 

SNP TSC1 130902 was amplified using the following primer set: 
10 First primer: 

5* TCTAACCATTGCGAATTCAGGGCAAGGGGGGTGAGATC 3' 

Second primer: 
5* TGACTTGGATCCGGGACAACGACTCATCC 3' 



15 



The first primer contained a biotin tag at the 5' end and a recognition site for the restriction 
enzyme EcoRI. The second primer contained the recognition site for the restriction enzyme BsmF 
20 I. The first primer was designed to anneal at various distances from the locus of interest. 

The first primer for SNP TSC0607185 was designed to anneal ninety bases from the locus 
of interest. The first primer for SNP TSC1 130902 was designed to anneal sixty bases from the 
locus of interest 

All loci of interest were amplified from the template genomic DNA using the polymerase 
25 chain reaction (PCR, U.S. Patent Nos. 4,683,195 and 4,683,202, incorporated herein by reference). 
In this example, the loci of interest were amplified in separate reaction tubes but they could also be 
amplified together in a single PCR reaction. For increased specificity, a "hot-start" PCR was used. 
PCR reactions were performed using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog 
number 203443). The amount of template DNA and primer per reaction can be optimized for each 
30 locus of interest but in this example, 40 ng of template human genomic DNA and 5 \M of each 

primer were used. Forty cycles of PCR were performed. The following PCR conditions were used: 

(1 ) 95°C for 15 minutes and 1 5 seconds; 

(2) 37°Cfor30 seconds; 

(3) 95°C for 30 seconds; 
35 (4) 57°C for 30 seconds; 
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(5) 95°C for 30 seconds; 

(6) 64°C for 30 seconds; 

(7) 95°Cfor30 seconds; 

(8) Repeat steps 6 and 7 thirty nine (39) times; 
5 (9) 72°C for 5 minutes. 

In the first cycle of PGR, the annealing temperature was about the melting temperature of 
the 3 ' annealing region of the second primers, which was 37°C. The annealing temperature in the 
second cycle of PCR was about the melting temperature of the 3' region, which anneals to the 
template DNA, of the first primer, which was 57°C. The annealing temperature in the third cycle of 

10 PCR was about the melting temperature of the entire sequence of the second primer, which was 
64 "C. The annealing temperature for the remaining cycles was 64°C. Escalating the annealing 
temperature from TM1 to TM2 to TM3 in the first three cycles of PCR greatly improves specificity. 
These annealing temperatures are representative, and the skilled artisan will understand the 
annealing temperatures for each cycle are dependent on the specific primers used. 

15 The temperatures and times for denaturing, annealing, and extension, can be optimized by 

trying various settings and using the parameters that yield the best results. 

Purification of Fragment of Interest 

The PCR products were separated from the genomic template DNA. One half of the PCR 
20 reaction was transferred to a well of a Streptawell, transparent, High-Bind plate from Roche 

Diagnostics GmbH (catalog number 1 645 692, as listed in Roche Molecular Biochemicals, 2001 
Biochemicals Catalog). The first primers contained a 5 1 biotin tag so the PCR products bound to the 
Streptavidin coated wells while the genomic template DNA did not. The streptavidin binding 
reaction was performed using a Thermomixer (Eppendorf) at 1000 rpm for 20 min. at 37°C. Each 
25 well was aspirated to remove unbound material, and washed three times with IX PBS, with gentle 
mixing (Kandpal et al„ Nucl. Acids Res. 18:1789-1795 (1990); Kaneoka et'al., Biotechniques 
10:30-34 (1991); Green et al., Nucl Acids Res. 18:6163-6164 (1990)). 

Restriction Enzyme Digestion of Isolated Fragments 
30 The purified PCR products were digested with the restriction enryme BsmF I, which binds 

to the recognition site incorporated into the PCR products from the second primer. The digests 
were performed in the Streptawells following the instructions supplied with the restriction enzyme. 
After digestion, the wells were washed three times with PBS to remove the cleaved fragments. 
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Incorporation of Labeled Nucleotide 

The restriction enzyme digest with BsmF I yielded a DNA fragment with a 5' overhang, 
which contained the SNP site or locus of interest and a 3' recessed end. The 5' overhang functioned 
as a template allowing incorporation of a nucleotide or nucleotides in the presence of a DNA 
polymerase. 

As discussed in detail in Example 6, the sequence of both alleles of a SNP can be 
determined by using one labeled nucleotide in the presence of the other unlabeled nucleotides. The 
following components were added to each fill in reaction: 1 fxl of fluorescently labeled ddGTP, 0.5 
\d of unlabeled ddNTPs ( 40 nM), which contained all nucleotides except guanine, 2 \xl of 10X 
sequenase buffer, 0.25 jil of Sequenase, and water as needed for a 20jil reaction. The fill in 
reaction was performed at 40°C for 10 min. Non-fluorescently labeled ddNTP was purchased from 
Fermentas Inc. (Hanover, MD). All other labeling reagents were obtained from Amersham 
(Thermo Sequenase Dye Terminator Cycle Sequencing Core Kit, US 79565). 

After labeling, each Streptawell was rinsed with IX PBS (100 jil) three times. The "filled 
in" DNA fragments were then released from the Streptawells by digestion with the restriction 
enzyme EcoRI, according to the manufacturer's instructions that were supplied with the enzyme. 
Digestion was performed for 1 hour at 37 °C with shaking at 120 rpm. 

Detection of the Locus of Interest 

The samples were loaded into a lane of a 36 cm 5% acrylamide (urea) gel (BioWhittaker 
Molecular Applications, Long Ranger Run Gel Packs, catalog number 50691). The samples were 
electrophoresed into the gel at 3000 volts for 3 min. The gel was run for 3 hours on a sequencing 
apparatus (Hoefer SQ3 Sequencer). The gel was removed from the apparatus and scanned on the 
Typhoon 9400 Variable Mode Imager. The incorporated labeled nucleotide was detected by 
fluorescence. A box was drawn around each band and the intensity of the band was calculated 
using the Typhoon 9400 Variable Mode Imager software. 

Below, a schematic of the 5' overhang for SNP TSC0607185 is shown. The entire DNA 
sequence is not reproduced, only the portion to demonstrate the ove rfiang (where R indicates the 
variable site). 

C C T R TGTC3' 

ACAG 5' 

4 3 2 1 Overhang position 



188 



WO 2004/078994 



PCT/US2004/006337 



The observed nucleotides at the variable site for TSC06071 85 on the 5 * sense strand (here 
depicted as the top strand) are cytosine and thymidine (depicted here as R). In this case, the second 
primer anneals from the locus of interest, which allows the fill-in reaction to occur on the anti-sense 
strand (depicted here as the bottom strand). The antisense strand will be filled in with guanine or 
adenine. 

The second position in the 5* overhang is thymidine, which is complementary to adenine, 
and the third position in the overhang corresponds to cytosine, which is complementary to guanine. 
Fluorescently labeled ddGTP in the presence of unlabeled dCTP, dTTP, and dATP was used to 
determine the sequence of both alleles. After the fill-in reaction, the following DNA molecules 
were generated: 



C C T C 
G* 

4 3 2 1 

C C T T 
G* A A 
4 3 2 1 



TGTC 3' Allele 1 
ACAG 5' 
Overhang position 

TGTC 3' Allele 1 
ACAG 5' 
Overhang position 



The overhang generated by BsmF I cutting at 1 1/15 from the recognition site at 
TSC0607185 is depicted below: 

C T R T GTC3M1/15 

CAG 5' 

3 2 10 Overhang position 

As labeled ddGTP is used for the fill-in reaction, no new signal will be generated from the 
molecules cut 1 1/15 from the recognition site. Position 0 complementary to the overhang was filled 
in with unlabeled dATP. Only signals generated from molecules filled in with labeled ddGTP at 
position 1 complementary to the overhang or molecules filled in with labeled ddGTP at position 3 
complementary to the overhang were seen. 

Five of the twenty-four individuals were heterozygous for SNP TSC0607185. As shown in 
FIG. 15, two bands were detected. The lower molecular weight band corresponded to DNA 
molecules filled in with ddGTP at position 1 complementary to the overhang. The higher molecular 
weight band corresponded to DNA molecules filled in with ddGTP at position 3 complementary to 
the overhang. 
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The ratio of the two alleles was calculated for each of the five heterozygous samples (see 
Table XVI). The average ratio of allele 2 to allele 1 was 1.000 with a standard deviation of 0.044. 
Thus, the allele ratio at SNP TSC0607185 was highly consistent. The experimentally calculated 
allele ratio for a particular SNP is hereinafter referred to as the "p" value of the SNP. Analysis of 
5 SNP TSC0607185 consistently will provide an allele ratio of 1:1, provided that the number of 
genomes analyzed is of sufficient quantity that no error is generated from statistical sampling. 

If the sample contained a low number of genomes, it is statistically possible that the primers 
will anneal to one chromosome over another chromosome. For example, if the sample contains 40 
genomes, which corresponds to a total of 40 chromosomes of allele 1 and 40 chromosomes of allele 
10 2, the primers may anneal to 40 chromosomes of allele 1 but only 35 chromosome of allele 2. This 
would cause allele 1 to be amplified preferentially to allele 2, which would alter the ratio of allele 1 
to allele 2. This problem is eliminated by having a sufficient number of genomes in the sample. 

SNP TSC0607185 represents a SNP where the difference in the nucleotide at the variable 
site does not affect the PCR reaction, or digestion with the restriction enzyme or the fill-in reaction. 
15 The use of one nucleotide labeled with one fluorescent dye assures that the bands for one allele can 
be accurately compared to the bands for the second allele. There is no added complication of 
having to compare between two different lanes, or having to correct for the quantum coefficients of 
the dyes. Additionally, any effect from the alternate cutting properties of the type IIS restriction 
enzymes has been removed. 

20 

TABLE XVI. Ratio of allele 2 to allele 1 at SNPs TSC0607185 and TSC1 130902. 



SNP TSC0607185 SNP TSC1 130902 



Sample 


Allele 1 


Allele 2 


Allele2/Allele 1 


Allele 1 


Allele 2 


Allele2/Allele 1 


1 


2382 


2313 


0.971033 


5877 


4433 ' 


0.754296 


2 


1581 


1533 


0.969639 


3652 


2695 


0.737952 


3 


1795 


1879 


1.046797 


5416 


3964 


0.730059 


4 


1921 


1855 


0.965643 


3493 


2663 


0.762382 


5 


1618 


1701 


1.051298 


3894 


2808 


0.721109 
















Average 






1.000882 






0.74116 


STD 






0.044042 






0.017018 
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Below, a schematic of the 5' overhang for SNP TSC1 130902 is shown. The entire DNA 
sequence is not reproduced, only the portion to demonstrate the overhang (\yhere R indicates the 
variable site). 

5 5' TTCAT 

3'AAGTA R T C C 
Overhang position 12 3 4 



The observed nucleotides for TSC1 130902 on the 5' sense strand (here depicted as the top 
10 strand) are adenine and guanine. The second position in the overhang corresponds to a thymidine, 
and the third position in the overhang corresponds to cytosine, which is complementary to guanine. 
Fluorescently labeled ddGTP in the presence of unlabeled dCTP, dTTP, and dATP was used to 
determine the sequence of both alleles. After the fill-in reaction, the following DNA molecules 



15 



were generated: 



Allele 1 5' TTCAT G* 

3'AAGTA C T C C 
Overhang position 12 3 4 



20 Allele 2 5' TTCAT A A G* 

3'AAGTA T T C C 
Overhang position 12 3 4 



As shown in FIG. 15, two bands were detected. The lower molecular weight band 
25 corresponded to DNA molecules filled in with labeled ddGTP at position 1 complementary to the 
overhang (the G allele). The higher molecular weight band, separated by a single base from the 
lower band, corresponded to DNA molecules filled in with ddGTP at position 3 complementary to 
the overhang (the A allele). 

Five of the twenty-four individuals were hetero2ygous for SNP TSC1 130902. As seen in 
30 FIG. 15, the band corresponding to allele 1 was more intense than the band corresponding to allele 
2. This was seen for each of the five individuals. The actual intensity of the band corresponding to 
allele 1 varied from individual to individual but it was always more intense than the band 
corresponding to allele 2. For the five individuals, the average ratio of allele 2 to allele 1 was 
0.741 16, with a standard deviation of 0.017018. 
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Template DNA was prepared from five different individuals. Separate PCR reactions, 
separate restriction enzyme digestions, and separate fill-in reactions were performed. However, for 
each template DNA, the ratio of allele 2 to allele 1 was about 0.75. The "p" value for this SNP was 
highly consistent. 

5 For example, for SNP TSOl 130902, the "p" valiie was 0.75. Any deviation from this 

value, provided the sample contains an adequate number of genomes to remove statistical sampling 
errors, will indicate that there is an abnormal copy number of chromosome 13. If there is an 
additional copy of allele 2, the "p" value will be higher than the expected 0.75. However, if there is 
an addition copy of allele 1, the "p" value will be lower than the expected 0.75. With the "p" value 

10 quantitated for a particular SNP, that SNP can be used to determine the presence or absence of a 
chromosomal abnormality. An accurate "p" value measured for a single SNP will be sufficient to 
detect the presence of a chromosomal abnormality. 

There are several possible explanations for why the ratio of one allele to the other allele at 
some SNPs varies from the theoretically expected ratio of 1:1. First, it is possible that the DNA 

15 polymerase incorporates one nucleotide faster than the other nucleotide. As the alleles are being 
amplified by PCR, even a slight preference for one nucleotide over the other may cause variation 
from the expected 1:1 ratio. This potential preference for one nucleotide over the other is not seen 
during the fill-in reaction because a single nucleotide labeled with one dye is used. 

It is also possible that the variable nucleotide at the SNP site influences the rate of 

20 denaturation of the two alleles. If allele 1 contains a guanine and allele 2 contains an adenine, the 
difference between the strength of the bonds for these nucleotides may affect the rate at which the 
DNA strands separate. Again, it is important to mention that the alleles are being amplified by PCR 
so very subtle differences can make a large impact on the final result. It is also possible that the 
variable nucleotide at the SNP site influences the rate at which the two strands anneal after 

25 separation. 

Alternatively, it is possible that the type IIS restriction enzyme cuts one allele 
preferentially to the other allele. As discussed in detail above, type IIS restriction enzymes cut at a 
distance from the recognition site. It is possible that the variable nucleotide at the SNP site 
influences the efficiency of the restriction enzyme digestion. It is possible that at some SNPs the 
30 restriction enzyme cuts one allele with an efficiency of 100%, while it cuts the other allele with an 
efficiency of 90%. 

However, the fact that the ratio of allele 1 to allele 2 deviates from the theoretically 
expected ratio of 1:1, does not influence or reduce the utility of that SNP. As demonstrated above, 
the "p" value for each SNP is consistent among different individuals. 
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The "p" value for any SNP can be calculated by analyzing the template DNA of any 
number of heterozygous individuals including but not limited to 1-10, 11-20, 21-30, 31-40, 41-50, 
51-60, 61-70, 71-80, 81-90, 91-100, 101-110, 111-120, 121-130, 131-140, 141-150, 151-160, 161- 
170, 171-180, 181-190, 191-200, 201-210, 21 1-220, 221-230, 231-240, 241-250, 251-260, 261-270, 
5 271-280, 281-290, 291-300, and greater than 300. 

The methods described herein allow the "p" value for any SNP to be determined. It is 
possible that some SNPs will behave more consistently than other SNPs. In the human genome, 
there are over 3 million SNPs; it is not possible to speculate on how each SNP will behave. The 
"p" value for each SNP will have to be experimentally determined. The methods described herein 
10 allow identification of SNPs that have highly consistent, and reproducible "p" values. 

EXAMPLE 10 

As discussed in Example 9, the ratio of one allele to the other allele at a particular SNP may 
vary from the theoretically expected ratio of 50:50. These SNPs can be used to detect the presence 
15 of additional chromosomes provided that the ratio of one allele to the other allele remains linear in 
individuals with chromosomal disorders. For example, at SNP X if the percentage of allele 1 to 
allele 2 is 75:25, the expected percentage of allele 1 to allele 2 for an individual with Down's 
syndrome must be properly adjusted to reflect the variation from the expected percentage at this 
SNP. 

20 The percentage of allele 1 to allele 2 for SNP TSC0108992 on chromosome 21 was 

calculated using template DNA from four normal individuals and template DNA from an individual 
with Down's syndrome. As demonstrated below, the percentage of one allele to the other allele was 
consistent and remained linear in an individual with Down's syndrome. 

25 Preparation of Template DNA 

DNA was obtained from four individuals with a normal genetic karyotype and an individual 
identified as having an extra copy of chromosome 21 (Down's syndrome). Informed consent was 
obtained from all individuals. Informed consent also was obtained from the parents of the 
30 individual with Down's syndrome. 

From each individual, a 9 ml blood sample was collected into a sterile tube (Fischer 
Scientific, 9 ml EDTA Vacuette tubes, catalog number NC9897284). Template DNA was isolated 
using the QIAmp DNA Blood Midi Kit supplied by QIAGEN (Catalog number 51183). The 
template DNA was isolated as per instructions included in the kit. 

35 
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Design of Primers 

SNP TSC0108992 was amplified using the following primer set: 
5 First primer: 

5* CTACTGAGGGCTCGTAGATCCCAATTCCTTCCCAAGCT 3* 
Second primer: 

10 

5' AATCCTGCTTTAGGGACCATGCTGGTGGA 3' 

The first primer contained a biotin tag at the 5' end and a recognition site for the restriction 
enzyme EcoRI. The second primer contained the recognition site for the restriction enzyme BsmF 
15 I. 

SNP TSC01 08992 was amplified from the template genomic DNA using the polymerase 
chain reaction (PCR, U.S. Patent Nos. 4,683,195 and 4,683,202, incorporated herein by reference). 

For increased specificity, a "hot-start" PCR was used. PCR reactions were performed using 
the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). The amount of 
20 template DNA and primer per reaction can be optimized for each locus of interest. In this example, 
50 ng of template human genomic DNA and 5 \xM of each primer were used. Thirty-eight cycles of 
PCR were performed. The following PCR conditions were used: 

(1) 95°C for 15 minutes and 15 seconds; 

(2) 37°C for 30 seconds; 
25 (3) 95°C for 30 seconds; 

(4) 57°C for 30 seconds; 

(5) 95°Cfor30 seconds; 

(6) 64°Cfor30 seconds; 

(7) 95°C for 30 seconds; 

30 (8) Repeat steps 6 and 7 thirty-seven (37) times; 

(9) 72°C for 5 minutes. 
In the first cycle of PCR, the annealing temperature was about the melting temperature of 
the 3 ' annealing region of the second primers, which was 37°C. The annealing temperature in the 
second cycle of PCR was about the melting temperature of the 3' region, which anneals to the 
35 template DNA, of the first primer, which was 57°C. The annealing temperature in the third cycle of 
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PCR was about the melting temperature of the entire sequence of the second primer, which was 
64°C. The annealing temperature for the remaining cycles was 64°C. Escalating the annealing 
temperature from TM1 to TM2 to TM3 in the first three cycles of PCR greatly improves specificity. 
These annealing temperatures are representative, and the skilled artisan will understand the 
5 annealing temperatures for each cycle are dependent on the specific primers used. 

The temperatures and times for denaturing, annealing, and extension, can be optimized by 
trying various settings and using the parameters that yield the best results. » 

Purification of Fragment of Interest 

1 0 The PCR products were separated from the genomic template DNA. Each PCR reaction 

was split into two samples and transferred to two separate wells of a Streptawell, transparent, High- 
Bind plate from Roche Diagnostics GmbH (catalog number 1 645 692, as listed in Roche Molecular 
Biochemicals, 2001 Biochemicals Catalog). For each PCR reaction, there were two replicates; each 
in a separate well of a microtiter plate. The first primer contained a 5* biotin tag so the PCR 

15 products bound to the Streptavidin coated wells while the genomic template DNA did not. The 
streptavidin binding reaction was performed using a Thermomixer (Eppendorf) at 1000 rpm for 20 
min. at 37°C. Each well was aspirated to remove unbound material, and washed three times with 
IX PBS, with gentle mixing (Kandpal et al., Nucl. Acids Res. 18:1789-1795 (1990); Kaneoka et al., 
Biotechniques 10:30-34 (1991); Green et al., Nucl. Acids Res. 18:6163-6164 (1990)). 

20 

Restriction Enzyme Digestion of Isolated Fragments 

The purified PCR products were digested with the restriction enzyme BsmF I, which binds 
to the recognition site incorporated into the PCR products from the second primer. The digests 
were performed in the Streptawells following the instructions supplied with the restriction enzyme. 
25 After digestion, the wells were washed three times with IX PBS to remove the cleaved fragments. 

Incorporation of Labeled Nucleotide 

The restriction enryme digest with BsmF I yielded a DNA fragment with a 5' overhang, 
which contained the SNP site or locus of interest and a 3 f recessed end. The 5* overhang functioned 
30 as a template allowing incorporation of a nucleotide or nucleotides in the presence of a DNA 
polymerase. 

As discussed in detail in Example 6, the sequence of both alleles of a SNP can be 
determined with one labeled nucleotide in the presence of the other unlabeled nucleotides. The 
following components were added to each fill in reaction: 1 ill of fluorescently labeled ddTTP, 0.5 
35 jil of unlabeled ddNTPs ( 40 fiM), which contained all nucleotides except thymidine, 2 \il of 10X 
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sequenase buffer, 0.25 \il of Sequenase, and water as needed for a 20jil reaction. The fill in 
reaction was performed at 40°C for 10 min. Non-fluorescently labeled ddNTP was purchased from 
Fermentas Inc. (Hanover, MD). All other labeling reagents were obtained from Amersham 
(Thermo Sequenase Dye Terminator Cycle Sequencing Core Kit, US 79565). 

After labeling, each Streptawell was rinsed with IX PBS (100 nl) three times. The "filled 
in" DNA fragments were then released from the Streptawells by digestion with the restriction 
enzyme EcoRI, according to the manufacturer's instructions that were supplied with the enzyme. 
Digestion was performed for 1 hour at 37 °C with shaking at 120 rpm. 

Detection of the Locus of Interest 

The samples were loaded into the lanes of a 36 cm 5% acrylamide (urea) gel (BioWhittaker 
Molecular Applications, Long Ranger Run Gel Packs, catalog number 50691). The samples were 
electrophoresed into the gel at 3000 volts for 3 min. The gel was run for 3 hours on a sequencing 
apparatus (Hoefer SQ3 Sequencer). The gel was removed from the apparatus and scanned on the 
Typhoon 9400 Variable Mode Imager. The incorporated labeled nucleotide was detected by 
fluorescence. A box was drawn around each band and the intensity of the band was calculated 
using the Typhoon 9400 Variable Mode Imager software. 

Below, a schematic of the 5' overhang for SNP TSC0108992 is shown. The entire DNA 
sequence is not reproduced, only the portion to demonstrate the overhang (where R indicates the 
variable site). 

GTCC3' 

G A C R CAGG5' 
4 3 2 1 Overhang Position 
The observed nucleotides for SNP TSC0108992 are adenine and thymidine on the sense 
strand (here depicted as the top strand). Position 3 of the overhang corresponds to adenine, which 
is complementary to thymidine. Labeled ddTTP was used in the presence of unlabeled dATP, 
dCTP, and dGTP. After the fill-in reaction with labeled ddTTP, the following DNA molecules 
were generated: 

T* G A GTCC3' Allele 1 
G A C T CAGG5' 
4 3 2 1 Overhang Position 
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T* GTCC3' Allele 2 
G A C A CAGG5' 
4 3 2 1 Overhang Position 

5 There was no difficulty in comparing the values obtained from allele 1 to allele 2 because 

one labeled nucleotide was used for the fill-in reaction, and the fill-in reaction for both alleles 
occurred in a single tube. The alternate cutting properties of BsmF I would not influence this 
analysis because the 1 1/15 overhang would be filled in just as the 10/14 overhang. Schematics of 
the filled-in 1 1/15 overhangs are depicted below: 

10 



T* 
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TCC3' 11/15 Allele 1 


A 


C 


T 


C 


AGG 5' 
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AGG 5' 
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1 


0 


Overhang Position 



As seen in FIG. 16, two bands were seen for each sample of template DNA. The lower 
20 molecular weight band corresponded to the DNA molecules filled in with ddTTP at position one 
complementary to the overhang, and the higher molecular weight band corresponded to DNA 
molecules filled in with ddTTP at position 3 complementary to the overhang. 

The percentage of allele 2 to allele 1 was highly consistent, (see Table XVII). In addition, 
for any given individual, the replicates of the PCR reaction showed similar results (see Table 
25 XVH). The percentage of allele 2 to allele 1 was calculated by dividing the value of allele 2 by the 
sum of the values for allele 1 and allele 2 (allele 2/(allele 1+ allele 2)). From four individuals, the 
average percentage of allele 2 to allele 1 was 0.4773 with a standard deviation of 0.0097. The 
percentage of allele 2 to allele 1 on template DNA isolated from an individual with Down's 
syndrome was 0.3086. 

30 The theoretically expected percentage of allele 2 to allele 1 using template DNA from a 

normal individual is 0.50. However, the experimentally determined percentage was 0.4773. The 
theoretically expected percentage of allele 2 to allele 1 for an individual with an extra copy of 
chromosome 21 is 0.33. The experimentally determined percentage of allele 2 to allele 1 for SNP 
TSC0108992 was 0.3086. 
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The deviation from the theoretically expected percentage is highly consistent and remains 
linear. The following formula demonstrates that the percentage of allele 2 to allele 1 at SNP 
TSCO 108992 remains linear even on template DNA obtained from an individual with an extra copy 
of chromosome 21: 

5 

0.47 X 



0.50 0.33 

10 X = 0.3 102 

If the percentage of allele 2 to allele 1 using template DNA obtained from a normal 
individual is determined to be 0.47, then the percentage of allele 2 to allele 1 using template DNA 
from an individual with Down's syndrome should be 0.3102. The experimentally determined ratio 

15 was 0.3086, with a standard deviation of 0.00186. There is no difference between the predicted 
percentage and the experimentally determined percentage of allele 2 to allele 1 on template DNA 
from an individual with Down's syndrome. 

The percentage of one allele to the other allele at a particular SNP is highly consistent, 
reproducible, and linear. This demonstrates that any SNP, regardless of tjie calculated percentage 

20 for one allele to another, can be used to determine the presence or absence of a chromosomal 
disorder. 

TABLE XVH. PercentageofAllele2toAllelelatSNPTSC0108992. 



Sample 


Allele 2 


Allele 1 


2/(2+1) 


1A 


9568886 


10578972 


0.474933 


1B 


8330864 


9221381 


0.474632 


2A 


9801053 


10345444 


0.486489 


2B 


8970942 


9603102 


0.482983 


3A 


8676718 


9211085 


0.485063 


3B 


10847024 


11420943 


0.487113 


4A 


10512420 


12227107 


0.462297 


4B 


7883584 


9055289 


0.465414 














MEAN 


0.477366 






STDEV 


0.009654 
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OS 


6797400 


15138959 


0.309869 


DS 


6025753 


13586890 


0.307238 














MEAN 


0.308554 






STDEV 


0.00186 



EXAMPLE 11 

5 

The percentage of allele 2 to allele 1 for a particular SNP is highly consistent. Statistically 
significant deviation from the experimentally determined ratio indicates the presence of a 
chromosomal abnormality. Below, the percentage of allele 2 to allele 1 at SNP TSC0108992 on 
chromosome 21 was calculated using template DNA from a normal individual and template DNA 
10 from an individual with Down's syndrome. Mixtures containing various amounts of normal DNA 
and Down's syndrome DNA were prepared and analyzed in a blind fashion. 

Preparation of Template DNA 

15 DNA was obtained from an individual with a normal genetic karyotype and an individual 

identified as having an extra copy of chromosome 21 (Down's syndrome). Informed consent was 
obtained from both individuals. Informed consent also was obtained from the parents of the 
individual with Down's syndrome. 

From each individual, a 9 ml blood sample was collected into a sterile tube (Fischer 

20 Scientific, 9 ml EDTA Vacuette tubes, catalog number NC9897284). Template DNA was isolated 
using the QIAmp DNA Blood Midi Kit supplied by QIAGEN (Catalog number 51183). The 
template DNA was isolated as per instructions included in the kit 

Mixtures of Template DNA 

The template DNA from the individual with the normal karyotype and the template DNA 
25 from the individual with an extra copy of chromosome 21 were diluted to a concentration of 10 
ng/jil. Four mixtures of normal template DNA and Down's syndrome template DNA were made in 
the following fashion: 
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Mixture 3 



Mixture 1 



Mixture 2; 



Mixture 4: 



32 jil of Normal DNA + 8 |il of Down's syndrome DNA 
28 jil of Normal DNA + 12 ^1 of Down's syndrome DNA 
20 |il of Normal DNA + 20 |al of Down's syndrome DNA 
10 |il of Normal DNA + 30 jil of Down's syndrome DNA 



Three separate PCR reactions were set up for the normal template DNA and the template 
DNA from the individual with Down's syndrome. Likewise, for each mixture, three separate PCR 
reactions were set up. 

Design of Primers 

SNP TSC0108992 was amplified using the following primer set: 
First primer: 

5' CTACTGAGGGCTCGTAGATCCCAATTCCTTCCCAAGCT 3' 
Second primer: 

5' AATCCTGCTTTAGGGACCATGCTGGTGGA 3' 

The first primer contained a biotin tag at the 5' end and a recognition site for the restriction 
enzyme EcoRL The second primer contained the recognition site for the restriction enzyme BsmF 
I. 

SNP TSC01 08992 was amplified from the template genomic DNA using the polymerase 
chain reaction (PCR, U.S. Patent Nos. 4,683,195 and 4,683,202, incorporated herein by reference). 

For increased specificity, a "hot-start" PCR was used. PCR reactions were performed using 
the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). The amount of 
template DNA and primer per reaction can be optimized for each locus of interest but in this 
example, 50 ng of template human genomic DNA and 5 (iM of each primer were used. Thirty-eight 
cycles of PCR were performed. The following PCR conditions were used: 



(1) 95°C for 15 minutes and 15 seconds; 



(2) 37°C for 30 seconds; 

(3) 95°C for 30 seconds; 

(4) 57°C for 30 seconds; 
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(5) 95°Cfor30 seconds; 

(6) 64°Cfor30 seconds; 

(7) 95°Cfor30 seconds; 

(8) Repeat steps 6 and 7 thirty-seven (37) times; 

(9) 72°C for 5 minutes. 

In the first cycle of PGR, the annealing temperature was about the melting temperature of 
the 3' annealing region of the second primers, which was 37°C. The annealing temperature in the 
second cycle of PCR was about the melting temperature of the 3* region, which anneals to the 
template DNA, of the first primer, which was 57°C. The annealing temperature in the third cycle of 
PCR was about the melting temperature of the entire sequence of the second primer, which was 
64*C. The annealing temperature for the remaining cycles was 64°C. Escalating the annealing 
temperature from TM1 to TM2 to TM3 in the first three cycles of PCR greatly improves specificity. 
These annealing temperatures are representative, and the skilled artisan will understand the 
annealing temperatures for each cycle are dependent on the specific primers used. 

The temperatures and times for denaturing, annealing, and extension, can be optimized by 
trying various settings and using the parameters that yield the best results. 

Purification of Fragment of Interest 

The PCR products were separated from the genomic template DNA. Each PCR reaction 
was split into two samples and transferred to two separate wells of a Streptawell, transparent, High- 
Bind plate from Roche Diagnostics GmbH (catalog number 1 645 692, as listed in Roche Molecular 
Biochemicals, 2001 Biochemicals Catalog). For each PCR reaction, there were two replicates, each 
in a separate well of a microtiter plate. The first primer contained a 5' biotin tag so the PCR 
products bound to the Streptavidin coated wells while the genomic template DNA did not. The 
streptavidin binding reaction was performed using a Thermomixer (Eppendorf) at 1000 rpm for 20 
min. at 37°C. Each well was aspirated to remove unbound material, and washed three times with 
IX PBS, with gentle mixing (Kandpal et al., Nucl. Acids Res. 18:1789-1795 (1990); Kaneoka et al., 
Biotechniques 10:30-34 (1991); Green et al., Nucl. Acids Res. 18:6163-6164 (1990)). 

Restriction En2yme Digestion of Isolated Fragments 

The purified PCR products were digested with the restriction enzyme BsmF I, which binds 
to the recognition site incorporated into the PCR products from the second primer. The digests 
were performed in the Streptawells following the instructions supplied with the restriction enzyme. 
After digestion, the wells were washed three times with IX PBS to remove the cleaved fragments. 
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Incorporation of Labeled Nucleotide 

The restriction enzyme digest with BsmF I yielded a DNA fragment with a 5' overhang, 
which contained the SNP site or locus of interest and a 3' recessed end. The 5 1 overhang functioned 
as a template allowing incorporation of a nucleotide or nucleotides in the presence of a DNA 
polymerase. 

As discussed in detail in Example 6, the sequence of both alleles of a SNP can be 
determined with one labeled nucleotide in the presence of the other unlabeled nucleotides. The 
following components were added to each fill in reaction: 1 jil of fluorescently labeled ddTTP, 0.5 
\il of unlabeled ddNTPs ( 40 fiM), which contained all nucleotides except thymidine, 2 ^1 of 10X 
sequenase buffer, 0.25 \il of Sequenase, and water as needed for a 20^1 reaction. The fill in 
reaction was performed at 40°C for 10 min. Non-fluorescently labeled ddNTP was purchased from 
Fermentas Inc. (Hanover, MD). All other labeling reagents were obtained from Amersham 
(Thermo Sequenase Dye Terminator Cycle Sequencing Core Kit, US 79565). 

After labeling, each Streptawell was rinsed with IX PBS (100 ^1) three times. The "filled 
in" DNA fragments were then released from the Streptawells by digestion with the restriction 
enzyme EcoRI, according to the manufacturers instructions that were supplied with the enzyme. 
Digestion was performed for 1 hour at 37 °C with shaking at 120 rpm. 

Detection of the Locus of Interest 

The samples were loaded into the lanes of a 36 cm 5% acrylamide (urea) gel (BioWhittaker 
Molecular Applications, Long Ranger Run Gel Packs, catalog number 50691). The samples were 
electrophoresed into the gel at 3000 volts for 3 min. The gel was run for 3 hours on a sequencing 
apparatus (Hoefer SQ3 Sequencer). The gel was removed from the apparatus and scanned on the 
Typhoon 9400 Variable Mode Imager. The incorporated labeled nucleotide was detected by 
fluorescence. A box was drawn around each band and the intensity of the band was calculated 
using the Typhoon 9400 Variable Mode Imager software. 

As seen in FIGS. 17 A-F, two bands were seen. The lower molecular weight band 
corresponded to the DNA molecules filled in with ddTTP at position one complementary to the 
overhang. The higher molecular weight band corresponded to DNA molecules filled in with ddTTP 
at position 3 complementaiy to the overhang. 

The experiment was performed in a blind fashion. The tubes were coded so that it was not 
known what tube corresponded to what template DNA. After the gels were analyzed, each tube 
was grouped into the following categories: normal template DNA, Down's syndrome template 



202 



WO 2004/078994 



PCT/US2004/006337 



DNA, 3:1 mixture of Down's syndrome template DNA to normal DNA, 1:1 mixture of normal 
template DNA to Down's syndrome template DNA, 1:2.3 mixture of Down's syndrome template 
DNA to normal template DNA, and 1:4 mixture of Down's syndrome template DNA to normal 
template DNA. Each replicate of each PCR reaction successfully was grouped into the appropriate 
5 category, which demonstrates that the method can be used to detect abnormal DNA even if it 
represents only a small percentage of the total DNA. 

The percentage of allele 2 to allele 1 for each replicate of the three PCR reactions from 
normal template DNA are displayed in Table XVm (also see FIG. 17A). The average percentage 
of allele 2 to allele 1 was calculated by dividing the value of allele 2 by the sum of the values for 

10 allele 1 and allele 2 (allele 2 / (allele 1 + allele 2)), which resulted in an average of 0.50025 with a 
standard deviation of 0.002897. Thus, allele 1 and allele 2 were present in a ratio of 50:50. While 
the intensity of the bands varied from one PCR reaction to another (compare reaction 1 with 
reaction 3), there was no difference in intensity within a PCR reaction. Furthermore, the values 
obtained for the two replicates of the PCR reactions were very similar. Most of the variation was 

15 between PCR reactions and was likely attributable to pipetting errors. 

The percentage of allele 2 to allele 1 for each replicate of the three PCR reactions from 
Down's syndrome template DNA are displayed in Table XVIII (see FIG. ,17B). The percentage of 
allele 2 to allele 1 was calculated by dividing the value of allele 2 by the sum of the values for allele 
1 and allele 2 (allele 2/allele 1+ allele 2), which resulted in an average of 0.301314 with a standard 

20 deviation of 0.012917. It is clear even upon analysis of the gel by the naked eye that allele 1 is 
present in a higher copy number than allele 2 (see FIG. 17B). Again, most of the variation occurs 
between PCR reactions and not within the replicate of a PCR reaction. The majority of the 
statistical variation likely resulted from pipetting errors. 

Analysis of a single SNP was sufficient to detect the presence of the chromosomal 

25 abnormality. One SNP is sufficient provided that the "p" value of the SNP is known and that there 
are an adequate number of genomes so that statistical sampling error is not introduced into the 
analysis. In this experiment, there were approximately 5,000 genomes in each reaction. 

The reactions that consisted of a mixture of Down's syndrome template DNA to normal 
template DNA at a ratio of 3 : 1 were clearly distinguishable from the normal template DNA, and the 

30 other mixtures of DNA (see FIG. 17C). The calculated percentage of allele 2 to allele 1 was 
0.319089 with a standard deviation of 0.004346 (see Table XVHI). Likewise, the reactions that 
consisted of a mixture of Down's syndrome template DNA to normal template DNA at ratios of 
1:1, and 1:2.3 were distinguishable (see FIG. 17D and 17E) and the values were statistically 
significant from all other reactions (see Table XVIff). 
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As the amount of normal template DNA increased, the percentage of allele 2 to allele 1 
increased. With a mixture of Down's syndrome template DNA to normal template DNA of 1:4, the 
percentage of allele 2 to allele 1 was 0.397642, with a standard deviation of 0.001903 (see FIG 
17F). The difference between this value and the value obtained from normal template DNA is 
5 statistically significant. Thus, the methods described herein allow the detection of a chromosomal 
abnormality even when the sample is not a homogeneous sample of abnormal DNA. ■ 

As described above, the presence of a small fraction of DNA. with an abnormal copy 
number of chromosomes can be detected even among a large presence of normal DNA. It was 
clear, even by the naked eye, that as the amount of normal DNA increased and the amount of 
10 Down's syndrome DNA decreased, the intensities of the bands that corresponded to alleles 1 and 2 
equalized. 

The above example analyzed a SNP located on chromosome 21. However, any SNP may 
be analyzed on any chromosome including but not limited to human chromosomes 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, X, and Y and fetal chromosomes 1, 2, 3, 4, 5, 
15 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, X, and Y. In addition, chromosomes 
from non-human organisms can be analyzed using the above methods. Any combination of 
chromosomes can be analyzed. In the above example, an extra copy of a chromosome was 
detected. However, the same methods can be used to detect monosomies. 

20 TABLE XVffl. Percentage of allele 2 to allele 1 at SNP TSC0108992 using normal template DNA 
and Down's syndrome template DNA. 







Normal Template 
DNA 






Allele 1 


Allele 2 


2/(2+1) 


1A 


2602115 


2604525 


0.500231 


1B 


2855846 


2923860 


0.505884 


2A 


1954765 


1941929 


0.498353 


2B 


2084476 


2068106 


0.498029 


3A 


2044147 


2035719 


0.498967 


3B 


1760291 


1760543 


0.500036 














Mean 


0.50025 






STD 


0.002897 














Down's Syndrome 
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EXAMPLE 12 



As discussed above in Example 9, the ratio for allele 1 to allele 2 at a heterozygous SNP is 
5 constant. However, one factor that can influence the ratio of allele 1 to allele 2 at a heterozygous 
SNP is a low number of genomes. For example, if there are 40 genomes, which means that there 
are a total of 40 chromosomes of allele 1 and 40 chromosomes of allele 2,' it is statistically possible 
that the primers may anneal to 40 of the chromosomes with allele 1 but only 30 of the chromosomes 
with allele 2. This will affect the ratio of allele 1 to allele 2, and can erroneously influence the "p" 
1 0 value for a particular SNP. 

Typically, whole genomic amplification, which employs degenerate oligonucleotide PCR, 
is used to increase low quantities of genomic DNA samples. Oligonucleotides of 8, 10, 12, or 14 
bases are used to amplify the genome. It is thought that the primers anneal randomly throughout 
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the genome, and will amplify a small genomic DNA sample into hundreds-fold more DNA for 
genetic analysis. 

The methods described herein exploit the fact that typically the whole genome is not of 
interest. Particular loci of interest located on one chromosome, or on multiple chromosomes or on 
5 chromosomes that represent the entire genome are selected for analysis. Even if the loci of interest 
are located on chromosomes for the entire genome, it is preferential to amplify the region of those 
chromosomes that contain the loci of interest. 

To overcome the limit of a low number of genomes, which is often seen with fetal DNA 
obtained from the plasma of a pregnant female, a multiplex method can be used to increase the 
10 number of genomes. The method described below preferentially amplifies the chromosome or 
chromosomes that contain the loci of interest. 

Preparation of Template DNA 

15 A 9 ml blood sample was collected into a sterile tube from a human volunteer after 

informed consent had been granted. (Fischer Scientific, 9 ml EDTA Vacuette tubes, catalog number 
NC9897284). The tubes were spun at 1000 ipm for ten minutes. The supernatant (the plasma) of 
each sample was removed, and one milliliter of the remaining blood sample, which is commonly 
referred to as the "buffy-coat" was transferred to a new tube. One milliliter of IX PBS was added 

20 to each sample. Template DNA was isolated using the QIAmp DNA Blood Midi Kit supplied by 
QIAGEN (Catalog number 5 1 1 83). 

Design of Multiplex Primers 

25 Primers were designed to anneal at various regions on chromosome 21 to increase the copy 

number of the loci of interest located on chromosome 21. The primers were 12 bases in length. 
However, primers of any length can be used including but not limited to 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 
12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36-45, 
46-55, 56-65, 66-75, 76-85, 86-95, 96-105, 106-115, 116-125, and greater than 125 bases. Primers 

3 0 were designed to anneal to both the sense strand and the antisense strand. 

Nine SNPs located on chromosome 21 were analyzed: TSC0397235, TSC0470003, 
TSC1649726, TSC1261039, TSC0310507, TSC1650432, TSC1335008, TSC0128307, and 
TSC0259757. Any number of SNPs can be analyzed including but not limited to 1-10, 1 1-20, 21- 
30, 31-40, 41-50, 51-60, 61-70, 71-80, 81-90, 91-100, 101-200, 201-300, 301-400, 401-500, 501- 
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600, 601-700, 701-800, 801-900, 901-1000, 1001-2000, 2001-3000, 3001-4000, 4001-5000, 5001- 
6000, 6001-7000, 7001-8000, 8001-9000, 9001-10,000 and greater than 10,000. 

For each of the 9 SNPs, a 12 base primer was designed to anneal approximately 130 bases 
upstream of the loci of interest, and a 12 base primer was designed to anneal approximately 130 
5 bases downstream of the loci of interest (herein referred to as the multiplex primers). The multiplex 
primers can be designed to anneal at any distance from the loci of interest including but not limited 
to 10-20, 21-30, 31-40, 41-50, 51-60, 61-70, 71-80, 81-90, 91-100, 101-110, 111-120, 121-130, 
131-140, 141-150, 151-160, 161-170, 171-180, 181-190, 191-200, 201-210, 211-220, 221-230, 
231-240, 241-250, 251-260, 261-270, 271-280, 281-290, 291-300, 301-310, 311-320, 321-330, 

10 331-340, 341-350, 351-360, 361-370, 371-380, 381-390, 391-400, 401-410, 411-420, 421-430, 
431-440, 441-450, 451-460, 461-470, 471-480, 481-490, 491-500, 501-600, 601-700, 701-800, 
801-900, 901-1000, 1001-2000, 2001-3000, 3001-4000, 4001-5000, and greater than 5000 bases. 
In addition, more than one set of multiplex primers can be used for one SNP including but not 
limited to 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 10-20, 21-30, 31-40, 41-50, and greater than 50. 

15 In addition, 91 sets of forward and reverse primers were used to amplify other regions of 

chromosome 21, for a total of 100 sets of primers (200 primers in the reaction). These 91 primer 
sets were used to demonstrate that a large number of primers can be used in a single reaction 
without producing a large number of non-specific bands. Any number of primers can be used in the 
reaction including but not limited to 1-10, 11-20, 21-30, 31-40, 41-50, 51-60, 61-70, 71-80, 81-90, 

20 91-100, 101-200, 201-300, 301-400, 401-500, 501-600, 601-700, 701-800, 801-900, 901-1000, 
1001-2000, 2001-3000, 3001-4000, 4001-5000, 5001-6000, 6001-7000, 7001-8000, 8001-9000, 
9001-10,000, 10,001-20,000, 20,001-30,000 and greater than 30,000. 

The multiplex primers were designed to have the same nucleotides at the 3' end of the 
primer. In this case, the multiplex primers ended in "AA," wherein A indicates adenine. The 

25 primers were designed in this manner to minimize primer-dimer formation. However, the primers 
can terminate in any nucleotides including but not limited to adenine, guanine, cytosine, thymidine, 
any combination of adenine and guanine, any combination of adenine and cytosine, any 
combination of adenine and thymidine, any combination of guanine and cytosine, any combination 
of guanine and thymidine, or any combination of cytosine and thymidine. In addition the multiplex 

30 primers can have 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more than 10 of the same nucleotides at the 3' end. 

The multiplex primers for SNP TSC0397235 were: 
Forward Primer: 

35 
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5' CAAGTGTCCTAA 3' 

Reverse primer: 

5' CAGCTGCTAGAA 3' 

The multiplex primers for SNP TSC0470003 were: 

Forward Primer: 

5 5 GGTTGAGGGCAA 3 ' 

Reverse primer: 

5' CACAGCGGGTAA 3' 

The multiplex primers for SNP TSC 1649726 were: 

Forward Primer: 

5' TTGACTTTTTAA 3' 

Reverse primer: 

5' ACAGAATGGGAA 3' 

The multiplex primers for SNP TSC1261039 were: 

Forward Primer: 

5' TGCAGGTCACAA 3' 

Reverse primer: 

5' TTCTTCTTATAA 3' 
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The multiplex primers for SNP TSC03 1 0507 were: 

Forward Primer: 

5 s AGGACAACCTAA 3' 

Reverse primer: 

5' TGGTGTTCAGAA 3' 

The multiplex primers for SNP TSC1650432 were: 

Forward Primer: 

5' TCAGCATATGAA 3 5 

Reverse primer: 

5'GTTGCCACACAA3' 

The multiplex primers for SNP TSC1335008 were: 

Forward Primer: 

5'CCCAGCTAGCAA3' 

Reverse primer: 

5' GGGTCACTGTAA 3 9 

The multiplex primers for SNP TSC0128307 were: 

Forward Primer: 
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5* TTAAATACCCAA 3' 

Reverse primer: 

5' TTAGGAGGTTAA 3' 

The multiplex primers for SNP TSC0259757 were: 

Forward Primer: 

5' ACACAGAATCAA 3' 

Reverse primer: 

5 9 CGCTGAGGTCAA 3' 

Ninety-one (91) additional sets of primers, which annealed to various regions along 
chromosome 21, were included in the reaction: 

Setl: 

Forward Primer: 

5' AAGTAGAGTCAA 3* 

Reverse primer: 

5' CTTCCCATQGAA 3' 

Set 2: 

Forward Primer: 

5 ' TTGGTTATTAAA 3 ' 

Reverse primer: 
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5 ' CAACTTACTGAA 3 ' 
Set 3: 

Forward Primer: 

5' CACTAAGTGAAA 3' 

Reverse primer: 

5'CTCACCTGCCAA3' 

Set 4: 

Forward Primer: 

5' ATGCATATATAA 3 ' 

Reverse primer: 

5' AGAGATCAGCAA 3' 

Set 5: 

Forward Primer: 

5' TATATTTTTCAA 3' 

Reverse primer: 

5' CAGAAAGCAGAA 3' 

Set 6: 

Forward Primer: 

5' GTATTGGGTTAA 3' 

Reverse primer: 
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5' CTGACCCAGGAA 3' 
Set 7: 

Forward Primer: 

5' CAGTTTTCCCAA 3' 

Reverse primer: 

5' AGGGCACAGGAA 3' 
Set 8: 

Forward Primer: 

5' GTATCAGAGGAA 3' 

Reverse primer: 

5' GCATGAAAAGAA 3' 

Set 9: 

Forward Primer: 

5' GATTTGACAGAA 3' 

Reverse primer: 

5' TACAGTTTACAA 3 ' 

Set 10: 

t 

Forward Primer: 

5* TGTGATTTTTAA 3* 

Reverse primer: 



213 



WO 2004/078994 

5' TTATGTTCTCAA 3' 
Set 11: 

Forward Primer: 

5' CAAGTACTTGAA 3' 

Reverse primer: 

5' CTTGTGTGGCAA 3' 

Set 12: 

Forward Primer: 

5' AGACTTCTGCAA 3' 

Reverse primer: 

5' GTTGTCTTTCAA 3' 

Set 13: 

Forward Primer: 

5' GGGACACTCCAA 3' 

Reverse primer: 

5' ATTATTATTCAA 3' 

Set 14: 

Forward Primer: 

5' ACATGATGACAA 3' 
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Reverse primer: 

5' TCAATTATAGAA 3' 

Set 15: 

Forward Primer: 

5' CTATGGGCTGAA 3' 

Reverse primer: 

5 , TGTGTGCCTGAA 3' 

Set 16: 

Forward Primer: 

5' CCATTTGTTGAA 3' 

Reverse primer: 

5' TCTCCATCAAAA 3' 

Set 17: 

Forward Primer: 

5' AATGCTGACAAA 3' 

Reverse primer: 

5' TTTCATGTCCAA 3' 

Set 18: 

Forward Primer: 

5' GGCCTCTTGGAA 3' 
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Reverse primer 

5' TCATTTTTTGAA 3' 

Set 19: 

Forward Primer: 

5' GGACTACCATAA 3' 

Reverse primer: 

5' AGTCACTCAGAA 3* 

Set 20: 

Forward Primer: 

5* CCTTGGCAGGAA 3 9 

Reverse primer: 

5' TTTCTGGTAGAA 3' 

Set 21: 

Forward Primer: 
5'CCCCCCCCCGAA3' 
Reverse primer: 
5'GCCCAGGCAGAA3 > 
Set 22: 

Forward Primer: 
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5' GAATGCGAAGAA 3' 
Reverse primer: 
5' TTAGGTAGAGAA 3' 
Set 23: 

Forward Primer: 

5' TGCTTTGGTCAA 3' 

Reverse primer: 

5* GCCCATTAATAA 3* 

Set 24: 

Forward Primer: 

5 s TGAGATCTTTAA 3' 

Reverse primer: 

5' CAGTTTGTTCAA 3' 

Set 25: 

Forward Primer: 

5' GCTGGGCAAGAA 3* 

Reverse primer: 

5' AGTCAAAGTCAA 3' 

Set 26: 

Forward Primer: 
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5' TCTCTGCAGTAA 3' 
Reverse primer: 
5 ' TGAATAACTTAA 3 ' 
Set 27: 

Forward Primer: 

5' CGGTTAGAAAAA 3' 

Reverse primer: 

S'CATCCCTTTCAAS' 
Set 28: 

Forward Primer: 

5' TCTCTTTCTGAA 3' 

Reverse primer: 

5 ' CTC AGATTGTAA 3 ' 

Set 29: 

Forward Primer: 

5' TTTGCACCAGAA 3' 

Reverse primer: 

5' GGTTAACATGAA 3' 

Set 30: 
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Forward Primer: 

5' ATTATCAACTAA 3' 

Reverse primer: 

5' GCCATTTTGTAA 3' 

Set31: 

Forward Primer: 

5' GATCTAGATGAA 3' 

Reverse primer: 

5' TTAATGTATTAA 3' 

Set 32: 

Forward Primer: 

5' CTAGGGAGACAA 3' 

Reverse primer: 

5' TGGAGGAGACAA 3' 

Set 33: 

Forward Primer: 

5' CATCACATTTAA 3 ' 

Reverse primer: 

5' GGGGTCCTGCAA 3' 
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Set 34; 

Forward Primer: 

5' CAGTTGTGCTAA 3' 

Reverse primer: 

5' TCTGCAGCCTAA 3' 

Set 35: 

Forward Primer: 

5' GAGTCATTTAAA 3' 

Reverse primer: 

5' TCTATGGATTAA 3' 

Set 36: 

Forward Primer: 

5' CAAAAAGTAGAA 3' 

Reverse primer: 

5' AATATACTCCAA 3' 

Set 37: 

Forward Primer: 

5'CGTCCAGCACAA3' 

Reverse primer: 

5' GGATGGTGAGAA3* 
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Set 38: 

Forward Primer: 

5' TCTCCTTTGTAA 3' 

Reverse primer: 

5' TCGTTATTTCAA 3' 

Set 39: 

Forward Primer: 

5' GATTTTATAGAA 3' 

Reverse primer: 

5' AGACATAAGCAA 3' 

Set 40: 

Forward Primer: 

S'TTCACCTCACAAS' 

Reverse primer: 

5' GGATTGCTTGAA 3' 

Set 41: 

Forward Primer: 

5' ACTGCATGTGAA 3' 

Reverse primer: 
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5' TTTATCACAGAA 3' 
Set 42: 

Forward Primer 

5' TCAGTAACACAA 3' 

Reverse primer: 

5' TACATCTTTGAA 3' 

Set 43: 

Forward Primer: 

5' TTGTTTCAGTAA 3' 

Reverse primer: 

5' TATGAGCATCAA 3' 

Set 44: 

Forward Primer: 

5' CTCAGCAGGCAA 3' 

Reverse primer: 

5' ACCCCTGTATAA 3' 

Set 45: 

Forward Primer: 

5' TCTGCTCAGCAA 3' 

Reverse primer: 
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5' GTTCTTTTTTAA 3 ' 
Set 46: 

Forward Primer: 

5' GTGATAATCCAA 3' 

Reverse primer 

5' GAGCCCTCAGAA 3' 

Set 47: 

Forward Primer 

5' TTTATTGGTTAA 3' 

Reverse primer: 

5' GGTACTGGGCAA 3 ' 

Set 48: 

Forward Primer: 

5' AGTGTTTTTCAA 3' 

Reverse primer: 

5' TGTTATTGGTAA 3' 

Set 49: 

Forward Primer: 

5' GCGCATTCACAA 3' 
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Reverse primer: 

5' AAACAAAAGCAA 3' 

Set 50: 

Forward Primer: 

5' TATATGATAGAA 3' 

Reverse primer: 

5'TCCCAGTTCCAA3' 

Set 51: 

Forward Primer: 

5' AAAGCCCATAAA 3' 

Reverse primer: 

5' TGTCATCCACAA 3' 

Set 52: 

Forward Primer: 

5' TTGTGAATGCAA 3' 

Reverse primer: 

5' GTATTCATACAA 3' 

Set 53: 

Forward Primer: 

5' TGACATAGGGAA 3' 
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Reverse primer: 

5' AGCAAATTGCAA 3' 

Set 54: 

Forward Primer: 

5' AGTAGATGTTAA 3' 

Reverse primer: 

5' AAAAGATAATAA 3' 

Set 55: 

Forward Primer: 

5' ACCTCATGGGAA 3' 

Reverse primer: 

5' TGGTCGACCTAA 3' 

Set 56: 

Forward Primer: 

5' TTTGCATGGTAA 3' 

Reverse primer: 

5' GCGGCTGCCGAA 3' 

Set 57: 

Forward Primer: 
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5' TCAGGAGTCTAA 3' 
Reverse primer: 
5' GCCTACCAGGAA 3' 
Set 58: 

Forward Primer: 

5' ATCTTCTGTTAA 3' 

Reverse primer: 

5' AGGTAAGGACAA 3' 

Set 59: 

Forward Primer: 

5' TGCTTTGAGGAA 3' 

Reverse primer: 

5 ' AAC AGTTTTAAA 3 ' 

Set 60: 

Forward Primer: 

5 ' TTAAATGTTTAA 3 ' 

Reverse primer: 

5' ATAGAAAATCAA 3* 

Set 61: 

Forward Primer: 
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5' GTGTTGTGTTAA 3' 
Reverse primer: 
5' GAGGACCTCGAA 3' 
Set 62: 

Forward Primer: 

5' AGAGGCTGAGAA 3' 

Reverse primer: 

5' GGTATTTATTAA 3' 

Set 63: 

Forward Primer: 

5' ATTTATCTGGAA 3' . 

Reverse primer: 

5' AGTGCAAACTAA 3' 

Set 64: 

Forward Primer: 

5' TGAACACCTTAA 3' 

Reverse primer: 

5' AATTTTTTCTAA 3' 

Set 65: 
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Forward Primer: 

5 5 TTACTATTATAA 3* 

Reverse primer: 

5' TGCTATAGTGAA 3' 

Set 66: 

Forward Primer: 

5' TGGACTATGGAA 3' 

Reverse primer: 

5'CTGCAGTCCGAA3> 

Set 67: 

Forward Primer: 

5'GCTACTGCCCAA3> 

Reverse primer: 

5' TCACATGGTGAA 3' 

Set 68: 

Forward Primer: 

5' GTGGCTCTGGAA 3* 

Reverse primer: 

5* GAATTCCATTAA 3' 
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Set 69: 

Forward Primer: 

5' TGGGGTGTCCAA 3' 

Reverse primer: 

5' GCAAGCTCCGAA 3' 

Set 70: 

Forward Primer: 

5' ATGTTTTTTCAA 3' 

Reverse primer: 

5' AGATCTGTTGAA 3' 

Set 71: 

Forward Primer: 

5' AAGTGCTGTGAA 3' 

Reverse primer: 

5' ACTTTTTTGGAA 3' 

Set 72: 

Forward Primer: 

5' AATCGGCAGGAA 3* 

Reverse primer: 

5' GGCATGTCACAA 3' 
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Set 73: 

Forward Primer: 

5' AGGAAGAAAGAA 3' 

Reverse primer: 

5' CAGTTTCACCAA 3' 

Set 74: 

Forward Primer: 

5' CACAGAATTTAA 3' 

Reverse primer: 

5' AAGAATAAGTAA 3' 

Set 75: 

Forward Primer: 

5' GGGATAGTACAA 3' 

Reverse primer: 

5' TTCCCATGATAA 3' 

Set 76: 

Forward Primer: 

5' TGATTAGTTGAA 3' 

Reverse primer: 
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5' GCATTCAGTGAA 3' 
Set 77: 

Forward Primer: 

5' AGGGAATATTAA 3' 

Reverse primer: 

5' GACCTTAGGTAA 3' 

Set 78: 

Forward Primer: 

5* TTCTTTTCACAA 3' 

Reverse primer: 

5 9 CCAAACTAAGAA 3' 

Set 79: 

Forv/ard Primer: 

5 9 GTGCTCTTAGAA 3' 

Reverse primer; 

5 9 ATGAGTTTAGAA 3' 

Set 80: 

Forward Primer: 

5 9 ATGAGCATAGAA 3 9 

Reverse primer: 
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5' GACAAATGAGAA 3' 
Set 81: 

Forward Primer: 

5' AAACCCAGAGAA 3' 

Reverse primer: 

5' CCTCACACAGAA 3' 

Set 82: 

Forward Primer: 

5' CACACTGTGGAA 3' 

Reverse primer: 

5'CACTGTACCCAA3' 

Set 83: 

Forward Primer: 

5' GTAGTATTTCAA 3' 

Reverse primer: 

5' TGGATACACTAA 3' 

Set 84: 

Forward Primer: 

5' CCCATGATTCAA 3' 



232 



WO 2004/078994 

Reverse primer: 

5' TCATAGGAGGAA 3* 

Set 85: 

Forward Primer: 

5' AGGAAAGAGAAA 3 9 

Reverse primer: 

5' ATATGGTGATAA 3' 

Set 86: 

Forward Primer: 

5'GATGCCATCCAA3' 

Reverse primer: 

5* ATACTATTTCAA 3' 

Set 87: 

Forward Primer: 

5* GTGTGCATGGAA 3' 

Reverse primer: 

5' AGGTGTTGAGAA 3* 

Set 88: 

Forward Primer: 
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5' CAGCCTGGGCAA 3' 
Reverse primer: 
5' GGAGCTCTACAA 3' 
Set 89: 

Forward Primer: 

5' AACTAAGGTTAA 3' 

Reverse primer: 

5' AACTTATGTTAA 3' 

Set 90: 

Forward Primer: 

5' ATCTCAACAGAA 3' 

Reverse primer: 

5' TAACAATGTGAA 3' 

Set 91: 

Forward Primer: 

5' AAGGATCAGGAA 3' 

Reverse primer: 
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5' CTCAAGTCTTAA 3' 



Multiplex PGR 

Regions on chromosome 21 surrounding SNPs TSC0397235, TSC0470003, TSC 1649726, 
TSC1261039, TSC0310507, TSC1650432, TSC1335008, TSC0128307, and TSC0259757 were 
amplified from the template genomic DNA using the polymerase chain reaction (PCR, U.S. Patent 
Nos. 4,683,195 and 4,683,202, incorporated herein by reference). This PCR reaction used primers 
that annealed approximately 130 bases upstream and downstream of the loci of interest It was used 
to increases the number of copies of the loci of interest to eliminate any errors that may result from 
a low number of genomes. 

For increased specificity, a "hot-start" PCR reaction was used. PCR reactions were 
performed using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). 
The amount of template DNA and primer per reaction can be optimized for each locus of interest. 
In this example, 15 ng of template human genomic DNA and 5 ^M of each primer were used. 

Two microliters of each forward and reverse primer, at concentrations of 5 mM were 
pooled into a single microcentrifuge tube and mixed. Eight microliters of the primer mix was used 
in a total PCR reaction volume of 40 fxl (1.5 yl of template DNA, 10.5 ^1 of sterile water, 8 |xl of 
primer mix, and 20^1 of HotStarTaq). Twenty-five cycles of PCR were performed. The following 
PCR conditions were used: 

(1) 95°C for 15 minutes; 

(2) 95°Cfor30 seconds; 

(3) 4°C for 30 seconds; 

(4) 37°Cfor30 seconds; 

(5) Repeat steps 2-4 twenty-four (24) times; 

(6) 72°C for 10 minutes. 

The temperatures and times for denaturing, annealing, and extension, can be optimized by 
trying various settings and using the parameters that yield the best results. 

In another embodiment, the loci of interest are amplified using 6-base oligonucleotides, 7- 
base oligonucleotides, 8-base oligonucleotides, 9-base oligonucleotides, 10-base oligonucleotides, 
11-base oligonucleotides, 12-base oligonucleotides, 13- base oligonucleotides, 14-base 
oligonucleotides, or greater than 14-base oligonucleotides. In a preferred embodiment, 6-base 
oligonucleotides, 7-base oligonucleotides, 8-base oligonucleotides, 9-base oligonucleotides, 10- 
base oligonucleotides, 1 1-base oligonucleotides, or 12-base oligonucleotides are used to amplify the 



235 



WO 2004/078994 



PCT/US2004/006337 



loci of interest. In another embodiment, any number of oligonucleotides can be used including but 
not limited to 1-5, 5-10, 10-15, 15-20, 20-25, 25-30, 30-35, 35-40, 40-45, 45-50, 50-100, 100-500, 
500-1000, 1000-2000, 2000-4000, 4000-8000, 8000-10,000 or greater than 10,000. With a small 
number of random oligos, the concentration of the oligos is large enough to allow efficient 
5 amplification, and yet, the number of oligos is small enough that it does not cause interference 
between the oligos. This allows efficient amplification of the genome. 

In another embodiment, the upstream and downstream sequences of the loci of interest are 
analyzed to identify a 6-base, 7-base, 8-base, 9-base, 10-base, 11 -base, or 12-base sequence that is 
present in the sequence upstream or downstream for each of the loci of interest, which is then used 

10 to amplify the loci of interest. In another embodiment, any number of 6-base oligonucleotides can 
be used to amplify the loci of interest including but not limited to 1-10, 10-50, 50-100, 100-200, 
200-500, or greater than 500. 

In another embodiment, the number of loci of interest from a small number of genomes can 
be increased by amplifying a limited number of the loci of interest, followed by removal of the 

15 primers, and amplification of the remaining loci of interest. All the loci of interest do not have to 
be multiplexed in one reaction. Any number of experimentally determined loci of interest can be 
multiplexed in a single reaction including but not limited to 1-5, 5-10, 10-25, 25-50, 50-100, 100- 
200, 200-400, or greater than 400. After increasing the number of copies of these loci of interest, 
the sample can be passed through a column that allows the amplified products to bind and the 

20 primers and unused dNTPs to be removed. After eluting the bound products from the column, 
different loci of interest can be amplified in a single reaction. This reduces the amount of 
interaction between the primers. 

Other methods of genomic amplification can also be used to increase the copy number of 
25 the loci of interest including but not limited to primer extension preamplification (PEP) (Zhang et 
aL, PNAS, 89:5847-51, 1992), degenerate oligonucleotide primed PCR (DOP-PCR) (Telenius, et 
aL, Genomics 13:718-25, 1992), strand displacement amplification using DNA polymerase from 
bacteriophage 29, which undergoes rolling circle replication (Dean et aL, Genomic Research 
1 1:1095-99, 2001), multiple displacement amplification (U.S. Patent 6,124,120), REPLI-g™ Whole 
30 Genome Amplification kits, and Tagged PCR 

Purification of Fragment of Interest 

The excess primers and nucleotides were removed from the reaction by using Qiagen 
35 MinElute PCR purification kits (Qiagen, Catalog Number 28004). The reactions were performed 
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following the manufacturer's instructions supplied with the columns. The DNA was eluted in 100 
jil of sterile water. 

PCR Reaction Two 

5 

SNP TSC0397235 was amplified using the following primer set: 
First Primer: 

10 5' TTAGTCATCGCAGAATTCTACTTCTTTCTGAAGTGGGA 3' 
Second primer: 
5' GGACAGCTCGATGGGACTAATGCATACTC 3 s 

15 

The first primer contained a biotin tag at the 5' end and a recognition site for the restriction 
enzyme EcoRI, and was designed to anneal 103 bases from the locus of interest. The second primer 
contained the recognition site for the restriction enzyme BsmF I. 

20 SNP TSC0470003 was amplified using the following primer set: 

First Primer: 

5' GTAGCCACTGGTGAATTCGTGCCATCGCAAAAGAATAA 3* 

25 

Second primer: 

5' ATTAGAATGATGGGGACCCCTGTCTTCCC 3' 

30 The first primer contained a biotin tag at the 5' end and a recognition site for the restriction 

enzyme EcoRI, and was designed to anneal 80 bases from the locus of interest. The second primer 
contained the recognition site for the restriction enzyme BsmF I. 

SNP TSC 1649726 was amplified using the following primer set: 

35 
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First Primer: 

5' ACGCATAGGAAGGAATTCATTCTGACACGTGTGAGATA 3* 

Second primer: 

5 s GAAATTGACCACGGGACTGCACACTTTTC 3' 

The first primer contained a biotin tag at the 5* end and a recognition site for the restriction 
enzyme EcoRI, and was designed to anneal 113 bases from the locus of interest. The second primer 
contained the recognition site for the restriction enzyme BsmF I. 

SNP TSC1261039 was amplified using the following primer set: 
First Primer: 

¥ CGGTAAATCGGAGAATTCAAGTTGAGGCATGCATCCAT 3' 

Second primer: 

5' TCGGGGCTCAGCGGGACCACAGCCACTCC 3' 

The first primer contained a biotin tag at the 5' end and a recognition site for the restriction 
enzyme EcoRI, and was designed to anneal 54 bases from the locus of interest. The second primer 
contained the recognition site for the restriction enzyme BsmF I. 

SNP TSC03 10507 was amplified using the following primer set: 

» 

First Primer: 

5' TCTATGCACCACGAATTCAATATGTGTTCAAGGACATT 3' 
Second primer: 
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5' TGCTTAATCGGTGGGACTTGTAATTGTAC 3' 

The first primer contained a biotin tag at the 5' end and a recognition site for the restriction 
enzyme EcoRI, and was designed to anneal 93 bases from the locus of interest. The second primer 
5 contained the recognition site for the restriction enzyme BsmF I. 

SNP TSC 1 650432 was amplified using the following primer set: ' 

First Primer: 

10 

5' CGCGTTGTATGCGAATTCCCTGGGGTATAAAGATAAGA 3' 
Second primer: 

15 5' CTCACGGGAACTGGGACACCTGACCCTGC 3 ■ 

The first primer contained a biotin tag at the 5' end and a recognition site for the restriction 
enzyme EcoRI, and was designed to anneal 80 bases from the locus of interest. The second primer 
contained the recognition site for the restriction enzyme BsmF I. 

20 

SNP TSC1335008 was amplified using the following primer set: 
First Primer: 

25 5' GTCTTGCCGCTTGAATTCCCATAGAAGAATGCGCCAAA 3 ' 
Second primer: 
5' TTGAGTAGTACAGGGACACACTAACAGAC 3' 

30 

The first primer contained a biotin tag at the 5' end and a recognition site for the restriction 
enzyme EcoRI, and was designed to anneal 94 bases from the locus of interest. The second primer 
contained the recognition site for the restriction enzyme BsmF I. 

35 
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SNP TSC0128307 was amplified using the following primer set: 
First Primer: 

5' AATACTGTAGGTGAATTCTTGCCTAAGCATTTTCCCAG 3' 

Second primer: 

5' GTGTTGACATTCGGGACTGTAATCTTGAC 3' 

The first primer contained a biotin tag at the 5' end and a recognition site for the restriction 
enzyme EcoRI, and was designed to anneal 54 bases from the locus of interest. The second primer 
contained the recognition site for the restriction enzyme BsmF L 

SNP TSC0259757 was amplified using the following primer set: . 

First Primer: 

5' TCTGTAGATTCGGAATTCTTTAGAGCCTGTGCGCTGAG 3' 

Second primer: 
5 5 CGTACCAGTACAGGGACGCAAACTGAGAC 3 s 

The first primer contained a biotin tag at the 5' end and a recognition site for the restriction 
enzyme EcoRI, and was designed to anneal 100 bases from the locus of interest The second primer 
contained the recognition site for the restriction enzyme BsmF I. 

All loci of interest were amplified from the template genomic DNA using the polymerase 
chain reaction (PCR, U.S. Patent Nos. 4,683,195 and 4,683,202, incorporated herein by reference). 
In this example, the loci of interest were amplified in separate reaction tubes but they can also be 
amplified together in a single PCR reaction. For increased specificity, a "hot-start" PCR was used. 
PCR reactions were performed using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog 
number 203443). 

One microliter of the elutate from the multiplex reaction (PCR product eluted from the 
MinElute column) was used as template DNA for each PCR reaction. Each SNP was amplified in 
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triplicate when the multiplex sample was used as the template. As a control, each SNP was 
amplified from 15 ng of the original template DNA (DNA that did not undergo the multiplex 
reaction). The amount of template DNA and primer per reaction can be optimized for each locus of 
interest but in this example, 5 \iM of each primer was used. Forty cycles of PCR were performed. 
The following PCR conditions were used: 

(1) 95°C for 15 minutes and 15 seconds; 

(2) 37°Cfor30 seconds; 

(3) 95°C for 30 seconds; 

(4) 57°C for 30 seconds; 

(5) 95°C for 30 seconds; 

(6) 64°C for 30 seconds; 

(7) 95°Cfor30 seconds; 

(8) Repeat steps 6 and 7 thirty nine (39) times; 

(9) 72°C for 5 minutes. 

In the first cycle of PCR, the annealing temperature was about the melting temperature of 
the 3' annealing region of the second primers, which was 37°C. The annealing temperature in the 
second cycle of PCR was about the melting temperature of the 3 1 region, which anneals to the 
template DNA, of the first primer, which was 57°C. The annealing temperature in the third cycle of 
PCR was about the melting temperature of the entire sequence of the second primer, which was 
64°C. The annealing temperature for the remaining cycles was 64°C. Escalating the annealing 
temperature from TM1 to TM2 to TM3 in the fust three cycles of PCR greatly improves specificity. 
These annealing temperatures are representative, and the skilled artisan will understand the 
annealing temperatures for each cycle are dependent on the specific primers used. 

The temperatures and times for denaturing, annealing, and extension, can be optimized by 
trying various settings and using the parameters that yield the best results. 

Agarose Gel Analysis 

Four microliters of a twenty microliter PCR reaction for each SNP from the original 
template DNA was analyzed by agarose gel electrophoresis (see FIG. 18 A). Four microliters of a 
twenty microliter PCR reaction for each SNP that was amplified from the multiplexed template was 
analyzed on by agarose gel electrophoresis (see FIG. 1 8B). 

As seen in FIG. 18A, for 8/9 of the SNPs amplified from the original template DNA, a 
single band of high intensity was seen (lanes 1-3, and 5-9). The band migrated at the correct 
position for each of the 8 SNPs. Amplification of TSC1261039 from the original template DNA 
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produced a band of high intensity, which migrated at the correct position, and a faint band of lower 
molecular weight (lane 4). Only two bands were seen, and the bands could clearly be distinguished 
based on molecular weight. The PCR method described herein allows clean amplification of the 
loci of interest from genomic DNA without any concentration or enrichment of the loci of interest. 

As seen in FIG. 18B, the primers used to amplify SNPs TSC0397235, TSC0470003, 
TSC03 10507, and TSC0128307 from the multiplexed template DNA produced a single band of 
high intensity, which migrated at the correct position (lanes 1, 2, 5, and 8). No additional bands 
were introduced despite the fact that the multiplex reaction contained two hundred primers. While 
the multiplex primers were 12 bases in length and likely annealed to additional sequences other than 
those located on chromosome 21, the products were not seen because the bands were not amplified 
in the second PCR reaction. The second PCR reaction employed primers specific for the loci of 
interest and used asymmetric oligonucleotides and escalating annealing temperatures, which allows 
specific amplification from the genome (see Example 1). 

Amplification of TSC 1649726 from the multiplex template DNA produced one band of 
high intensity and two weaker bands, which could clearly be distinguished based on molecular 
weight (see FIG. 18B, lane 3). Amplification of TSC1261039 from the multiplex template DNA 
produced a high intensity band of the correct molecular weight and a faint band of lower molecular 
weight (see FIG. 18B, lane 4). The low molecular weight band was the same size as the band seen 
from the amplification of TSC1261039 from the original template DNA (compare FIG. 18A, lane 4 
with FIG. 18B, lane 4). Thus, amplification of TSC1261039 on the multiplex template DNA did 
not introduce any additional non-specific bands 

Amplification of SNPs TSC1650432, TSC1335008, and TSC0259757 from the multiplex 
template DNA produced one band of high intensity, which migrated at the correct position, and one 
weaker band (lanes 6, 7, and 9). For SNPs TSC1650432 and TSC0259757, the weaker band was of 
lower molecular weight, and clearly was distinguishable from the band of interest (see FIG. 18B, 
lanes 6 and 9). For SNP TSC1335008, the weaker band was of slightly higher molecular weight. 
However, the correct band can be identified by comparing to the amplification products of 
TSC1335008 from the original template DNA, (compare FIG. 18A, lane 7 and FIG. 18B, lane 7). 
The PCR conditions can also be optimized for TSC1335008. All 9 SNPs were amplified under the 
exact same conditions, which produced clearly distinguishable bands for the amplified SNPs. 

Purification of Fragment of Interest 

The PCR products were separated from the genomic template DNA. One half of the PCR 
reaction was transferred to a well of a Streptawell, transparent, High-Bind plate from Roche 
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Diagnostics GmbH (catalog number 1 645 692, as listed in Roche Molecular Biochemicals, 2001 
Biochemicals Catalog). The first primers contained a 5' biotin tag so the PCR products bound to the 
Streptavidin coated wells while the genomic template DNA did not. The streptavidin binding 
reaction was performed using a Thennomixer (Eppendorf) at 1000 rpm for 20 min. at 37°C. Each 
well was aspirated to remove unbound material, and washed three times with IX PBS, with gentle 
mixing (Kandpal et al., Nucl. Acids Res. 18:1789-1795 (1990); Kaneoka et al., Biotechniques 
10:30-34 (1991); Green et al., Nucl. Acids Res. 18:6163-6164 (1990)). 

Restriction Enzyme Digestion of Isolated Fragments 

The purified PCR products were digested with the restriction enzyme BsmF I, which binds 
to the recognition site incorporated into the PCR products from the second primer. The digests 
were performed in the Streptawells following the instructions supplied with the restriction enzyme. 
After digestion, the wells were washed three times with PBS to remove the cleaved fragments. 

Incorporation of Labeled Nucleotide 

The restriction enzyme digest with BsmF I yielded a DNA fragment with a 5' overhang, 
which contained the SNP site or locus of interest and a 3' recessed end. The 5' overhang functioned 
as a template allowing incorporation of a nucleotide or nucleotides in the presence of a DNA 
polymerase. 

As discussed in detail in Example 6, the sequence of both alleles of a SNP can be 
determined by using one labeled nucleotide in the presence of the other unlabeled nucleotides. The 
following components were added to each fill in reaction: 1 fil of fluorescently labeled ddGTP, 0.5 
\il of unlabeled ddNTPs ( 40 \M), which contained all nucleotides except guanine, 2 \il of 10X 
sequenase buffer, 0.25 fxl of Sequenase, and water as needed for a 20(il reaction. The fill in 
reaction was performed at 40°C for 10 min. Non-fluorescently labeled ddNTP was purchased from 
Fermentas Inc. (Hanover, MD). All other labeling reagents were obtained from Amersham 
(Thermo Sequenase Dye Terminator Cycle Sequencing Core Kit, US 79565). 

After labeling, each Streptawell was rinsed with IX PBS ; (100 |il) three times. The "filled 
in" DNA fragments then were released from the Streptawells by digestion with the restriction 
enzyme EcoRI, according to the manufacturer's instructions that were supplied with the enzyme. 
Digestion was performed for 1 hour at 37 °C with shaking at 120 rpm. 
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Detection of the Locus of Interest 



The samples were loaded into a lane of a 36 cm 5% acrylamide (urea) gel (BioWhittaker 
Molecular Applications, Long Ranger Run Gel Packs, catalog number 50691). The samples were 
electrophoresed into the gel at 3000 volts for 3 min. The gel was run for 3 hours on a sequencing 
apparatus (Hoefer SQ3 Sequencer). The gel was removed from the apparatus and scanned on the 
Typhoon 9400 Variable Mode Imager. The incorporated labeled nucleotide was detected by 
fluorescence. A box was drawn around each band and the intensity of the band was calculated 
using the ImageQuant software. 

Below, a schematic of the 5' overhang for TSC0470003 after digestion with BsmF I is 
depicted: 



5' CTCT 

3' GAGA R A C C 
Overhang position 12 3 4 



The observed nucleotides for TSC0470003 are adenine and guanine on the sense strand 
(herein depicted as the top strand). The third position of the overhang corresponds to cytosine, 
which is complementary to guanine. Labeled ddGTP was used in the presence of unlabeled dATP, 
dCTP, and dTTP. Schematics of the DNA molecules after the fill-in reaction are depicted below: 



Allele 1 5' CTCT 

y GAGA 
Overhang position 

Allele 2 5' CTCT 

3' GAGA 
Overhang position 



G* 

C A C C 

12 3 4 

A T G* 

T A C C 

12 3 4 



Two bands were seen; the lower molecular weight band corresponded to the DNA molecules filled 
in with ddGTP at position 1 complementary to the overhang and the higher molecular weight band 
corresponded to the DNA molecules filled in with ddGTP at position 3 complementary to the 
overhang (see FIG. 19). 

The percentage of allele 2 to allele 1 at TSC0470003 after amplification from the original 
template DNA and the multiplexed template DNA was calculated. The use of one fluorescently 
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labeled nucleotide to detect both alleles in a single reaction reduces the amount of error that is . 
introduced through pipetting reactions, and the error that is introduced through the quantum 
coefficients of different dyes. 

For SNP TSC047003, the percentage of allele 2 to allele 1 was calculated by dividing the 
5 value of allele 2 by the sum of the values for allele 2 and allele 1. The percentage of allele 2 to 
allele 1 for TSC047003 on the original template DNA was calculated to be 0.539 (see Table XIX). 
Three PCR reactions were performed for each SNP on the multiplexed template DNA. The average 
percentage of allele 2 to allele 1 for TSC047003 on the multiplexed DNA was 0.49 with a standard 
deviation of 0.0319 (see Table XK). There was no statistically significant difference between the 

1 0 percentage obtained on the original template DNA and the multiplexed template DNA. 

For SNP TSC1261039, the percentage of allele 2 to allele 1 for TSC1261039 on the 
original template DNA was calculated to be 0.44 (see Table XK). Three PCR reactions were 
performed for each SNP on the multiplexed template DNA (see FIG. 19B). The average percentage 
of allele 2 to allele 1 for TSC1261039 on the multiplexed DNA was 0.468 with a standard deviation 

15 of 0.05683 (see Table XK). There was no statistically significant difference between the 
percentages of allele 2 to allele 1 obtained on the original template DNA and the multiplexed 
template DNA. 

The variation seen in the percentage of allele 2 to allele 1 for TSC1261039 on the 
multiplexed template DNA was likely due to pipetting reactions. The variation can be reduced by 
20 increasing the number of replicates. With a large number of replicates, a percentage can be 
obtained with minimum statistical variation. 

Likewise, there was no statistical difference between the percentage of allele 2 to allele 1 
on the original template DNA and on the multiplexed template DNA for SNPs TSC03 10507 and 
TSC1335008 (see Table XK, and FIGS. 19C and 19D). Thus, a multiplex reaction can be used to 
25 increase the number of chromosomal regions containing the loci of interest without affecting the 
percentage of one allele to the other at the variable sites. 

TABLE XK. Percentage of allele 2 to allele 1 at various SNPs with and without multiplexing. 



TSC047003 










Allele 1 


Allele 2 


2/(2+1) 


IA 


5535418 


6487873 


0.539608748 


M1 


4804358 


4886716 


0.504249168 


M2 


5549389 


5958585 


0.517778803 


M3 


8356275 


7030245 


0.45690936 
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Mean (M1-M3) 






0.49297911 


STDEV 
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794790 
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M3 
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STDEV 






0.027376412 



The methods described herein used two distinct amplification reactions to amplify the loci 
of interest. In the first PCR reaction, oligonucleotides were designed to anneal upstream and 
downstream of the loci of interest. Unlike traditional genomic amplification, these primers were 
5 not degenerate and annealed at a specified distance from the loci of interest. However, due to the 
length of the primers, it is likely that the primers annealed to other regions of the genome. These 
primers were used to increase the amount of DNA available for genetic analysis. 

The second PCR reaction employs the methods described in Examples 1-6. The primers 
are designed to amplify the loci of interest, and the sequence is determined at the loci of interest. 

10 The conditions of the second PCR reaction allowed specific amplification of the loci of interest 
from the multiplexed template DNA. If there were any non-specific products from the multiplex 
reaction, they did not impede amplification of the loci of interest. There was no statistical 
difference in the percentages of allele 2 to allele 1 at the four SNPs analyzed, regardless of whether 
the amplification was performed on original template DNA or multiplexed template DNA. 

15 The SNPs analyzed in this example were located on human chromosome 21. However, the 

methods can be applied to non-human and human DNA including but not limited to chromosomes 
1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, X, and Y. The multiplex 
methods can also be applied to analysis of genetic mutations including but not limited to nucleotide 
substitutions, insertions, deletions, and rearrangements. 

20 The above methods can be used to increase the amount of DNA available for genetic 

analysis whenever the starting template DNA is limiting in quantity. For example, pre-malignant 
and pre-invasive lesions with malignant cells usually constitute a small fraction of the cells in the 
specimen, which reduces the number of genetic analyses that can be performed. The methods 
described herein can be used to increase the amounts of malignant DNA available for genetic 

25 analysis. Also, the number of fetal genomes present in the maternal blood is often low; the methods 
described herein can be used to increase the amount of fetal DNA. 

EXAMPLE 13 

30 Plasma isolated from blood of a pregnant female contains both maternal template DNA and 

fetal template DNA. As discussed earlier, the percentage of fetal DNA in the maternal plasma 
varies for each pregnant female. However, the percentage of fetal DNA can be determined by 
analyzing SNPs wherein the maternal template DNA is homozygous and the template DNA 
obtained from the plasma displays a heterozygous pattern. 
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10 



For example, assume SNP X can either be adenine or guanine, and the maternal DNA for 
SNP X is homozygous for guanine. The labeling method described in Example 6 can be used to 
determine the sequence of the template DNA in the plasma sample. If the plasma sample contains 
fetal DNA, which is heterozygous at SNP X, the following DNA molecules are expected after 
digestion with the type IIS restriction enzyme BsmF I, and the fill-in reaction with labeled ddGTP, 
unlabeled dATP, dTTP, and dCTP. 

Maternal Allele 1 5' GGGT G* 

3'CCCA C T C A 

Maternal Allele 2 5' GGGT G* 

3'CCCA C T C A 



Fetal Allele 1 5' GGGT G* 

15 3'CCCA C T C A 

Fetal Allele 2 5' GGGT A A G* 

3'CCCA T T C A 

20 Two signals are seen; one signal corresponds to the DNA molecules filled in with ddGTP at 

position one complementary to the overhang and the second signal corresponds to the DNA 
molecules filled in with ddGTP at position three complementary to the overhang. However, the 
maternal DNA is homozygous for guanine, which corresponds to the DNA molecules filled in at 
position one complementary to the overhang. The signal from the DNA molecules filled in with 

25 ddGTP at position three complementary to the overhang corresponds to the adenine allele, which 
represents the fetal DNA. This signal becomes a beacon for the fetal 'DNA, and can used to 
measure the amount of fetal DNA present in the plasma sample. 

There is no difference in the amount of fetal DNA from one chromosome to another. For 
instance, the percentage of fetal DNA in any given individual from chromosome 1 is the same as 

30 the percentage of fetal DNA from chromosome 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, X and Y. Thus, the allele ratio calculated for SNPs on one chromosome can be 
compared to the allele ratio for the SNPs on another chromosome. 

For example, the allele ratio for the SNPs on chromosome 1 should be equal to the allele 
ratio for the SNPs on chromosomes 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 

35 21, 22, X, and Y. However, if the fetus has a chromosomal abnormality, including but not limited 
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to a trisomy or monosomy, the ratio for the chromosome that is present in an abnormal copy 
number will differ from the ratio for the other chromosomes. 

Blood from a pregnant female was collected after informed consent had been obtained./ The 
blood sample was used to demonstrate that fetal DNA can be detected in the maternal plasma by 
analyzing SNPs wherein the maternal DNA was homozygous, and the same SNP displayed a 
heterozygous pattern from DNA obtained from the plasma of a pregnant woman. 

Preparation of Plasma from Whole Blood 

Plasma was isolated from 4 tubes each containing 9 ml of blood (Fischer Scientific, 9 ml 
EDTA Vacuette tubes, catalog number NC9897284). The blood was obtained by venipuncture 
from a pregnant female who had given informed consent. After collecting the blood, formaldehyde 
(25 nl/ml of blood) was added to each of the tubes. The tubes were placed at 4°C until shipment. 
The tubes were shipped via Federal Express in a foam container containing an ice pack. 

The blood was centrifuged at 1000 rpm for 10 minutes. The brake on the centrifuge was 
not used. This centrifugation step was repeated. The supernatant was transferred to a new tube and 
spun at 3,000 rpm for ten minutes. The brake on the centrifuge was not used. The supernatant from 
each of the four tubes was pooled and aliquoted into two tubes. The plasma was stored at -80°C 
until the DNA was purified. 

Template DNA was isolated using the QIAmp DNA Blood Midi Kit supplied by QIAGEN 

(Catalog number 51183). The template DNA was isolated as per instructions included in the kit. 

» 

The template DNA from the plasma was eluted in a final volume of 20 microliters. 
Isolation of Maternal DNA 

After the plasma was removed from the sample described above, one milliliter of the 
remaining blood sample, which is commonly referred to as the "buffy-coat," was transferred to a 
new tube. One milliliter of IX PBS was added to the sample. Template DNA was isolated using 
the QIAmp DNA Blood Midi Kit supplied by QIAGEN (Catalog number 5 1 1 83). 

Identification of Homozygous Maternal SNPs 

Example 8 describes a method for identifying SNPs that are highly variable within the 
population or for identifying heterozygous SNPs for a given individual. The methods as described 
in Example 8 were applied to the maternal template DNA to identify SNPs on chromosome 13 
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wherein the maternal DNA was homozygous. Any number of SNPs can be screened. The number 
of SNPs to be screened is proportional to the number of heterozygous SNPs in the fetal DNA that 
need to be analyzed. 

As described in detail in Example 6, one labeled nucleotide can be used to determine the 
sequence of both alleles at a particular SNP. SNPs for which the sequence can be determined with 
labeled ddGTP in the presence of unlabeled dATP, dTTP, and dCTP were chosen for this example. 
However, SNPs for which the sequence can be determined with labeled ddATP, ddCTP or ddTTP 
can also be used. Additionally, the SNPs to be analyzed can be chosen such that all are labeled with 
the same nucleotide or any combination of the four nucleotides. For instance, if 400 SNPs are to be 
screened, 100 can be chosen such that the sequence is determined with labeled ddATP, 100 can be 
chosen such that the sequence is determined with labeled ddTTP, 100 can be chosen such that the 
sequence is determined with labeled ddGTP, and 100 can be chosen such that the sequence is 
determined with labeled ddCTP, or any combination of the four labeled nucleotides. 

Twenty-nine SNPs wherein the maternal DNA was homozygous were identified: 
TSC0052277, TSC1225391, TSC0289078, TSC1349804, TSC0870209, TSC0194938, 
TSC0820373, TSC0902859, TSC0501510, TSC1228234, TSC0082910, TSC0838335, 
TSC0818982, TSC0469204, TSC1084457, TSC0466177, TSC1270598, TSC1002017, 
TSC1 104200, TSC0501389, TSC0039960, TSC0418134, TSC0603688, TSC0129188, 
TSC1 103570, TSC0813449, TSC0701940, TSC0087962, and TSC0660274. Heterozygous 
SNPs will vaiy from individual to individual. 

Design of Multiplex Primers 

A low copy number of fetal genomes typically is present in the maternal plasma. To 
increase the copy number of the loci of interest located on chromosome 13, primers were designed 
to anneal at approximately 130 bases upstream and 130 bases downstream of each loci of interest 
This was done to reduce statistical sampling error that can occur when working with a low number 
of genomes, which can influence the ratio of one allele to another (see Example 1 1). The primers 
were 12 bases in length. However, primers of any length can be used including but not limited to 2, 
3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 
32, 33, 34, 35, 36-45, 46-55, 56-65, 66-75, 76-85, 86-95, 96-105, 106-115, 116-125, and greater 
than 125 bases. Primers were designed to anneal to both the sense strand and the antisense strand. 

The primers were designed to terminate at the 3' end in the dinucleotide "AA" to reduce the 
formation of primer-dimers. However, the primers can be designed to end in any of the four 
nucleotides and in any combination of the four nucleotides. 
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The multiplex primers for SNPTSC0052277 were 

Forward primer: 

5 5 GACATGTTGGAA 3' 

Reverse primer: 

5' ACTTCCAGTTAA 3* 

The multiplex primers for SNP TSC1225391 were: 

Forward primer: 

5'GTTTCCTGTTAA3' 

Reverse primer 

5' CGATGATGACAA 3' 

The multiplex primers for SNP TSC0289078 were: 

Forward primer 

5' GAGTAGAGACAA 3' 

Reverse primer 

5' TCCCGGATACAA 3' 

The multiplex primes for SNP TSC1349804 were: 

Forward primer: 
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5' CATCCTCTAGAA 3' 

Reverse primer: 

5 9 TATTCCTGAGAA 3' 

The multiplex primers for SNP TSC0870209 were: 

Forward primer: 

5 ' AGTTTGTTTTAA 3 5 

Reverse primer: 

5' TATAAACGATAA 3' 

The multiplex primers for SNP TSC0194938 were: 

Forward primer: 

5' TTTGACCGATAA 3' 

Reverse primer: 

5' TGACAGGACCAA 3' 

The multiplex primers for SNP TSC0820373 were: 

Forward primer: 

5' TTATTCATTCAA 3* 

» 

Reverse primer: 

5' AGTTTTTCACAA 3' 
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The multiplex primers for SNP TSC0902859 were: 
Forward primer: 

5 

5*CACCTCCCTGAA3' 
Reverse primer: 
10 5' CC AG ATTGAGAA 3 9 

The multiplex primers for SNP TSC0501510 were: 
Forward primer: 

15 

5' TGTGTCCACCAA 3' 
Reverse primer: 
20 5' CTTCTATTCCAA 3' 

The multiplex primers for SNP TSC 1228234 were: 
Forward primer: 
5' TCACAATAGGAA 3* 
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30 



Reverse primer: 
5 ' TACAAGTGAGAA 3 ' 
The multiplex primers for SNP TSC00829 10 were: 
3 5 Forward primer: 
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5' GAGTTTTCGTAA 3' 

Reverse primer: 

5'GTGTGCCCCCAA3' 

The multiplex primers for SNP TSC0838335 were: 

Forward primer: 

S'GCACCACTGCAAS* 

Reverse primer: 

5' GAACACAATGAA 3' 

The multiplex primers for SNP TSC08 18982 were: 

Forward primer: 

5' TATCCTATTCAA 3 s 

Reverse primer: 

5' CAACCATTATAA 3 9 

The multiplex primers for SNP TSC0469204 were: 

Forward primer: 

5' TATGCTTTACAA 3' 

Reverse primer: 

254 



WO 2004/078994 PCT/US2004/006337 
5 9 TTTGTTTACC AA 3 9 

The multiplex primers for SNP TSC1084457 were: 

Forward primer: 

5' AGGAAATTAGAA 3* 

Reverse primer: 

5 9 TGTTAGACTTAA 3' 

The multiplex primers for SNP TSC0466177 were: 

Forward primer: 

5' TATTTGGAGGAA 3' 

Reverse primer: 

5 ' GGC ATTTGTC AA 3 ' 

The multiplex primers for SNP TSC 1270598 were: 

Forward primer: 

5' ATACTCCAGGAA 3' 

Reverse primer: 

5'CAGCCTGGACAA3' 

The multiplex primers for SNP TSC1002017 were: 

Forward primer: 
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5* CCATTGCAGTAA 3' 

Reverse primer: 

5' AGGTTCTCATAA 3' 

The multiplex primers for SNP TSC1 104200 were: 

Forward primer: 

5' TGTCATCATTAA 3' 

Reverse primer: 

5' TGGTATTTGCAA 3' 

The multiplex primers for SNP TSC0501389 were: 

Forward primer: 

5' TAGGGTTTGTAA 3' 

Reverse primer: 

5' CCCTAAGTAGAA 3* 

The multiplex primers for SNP TSC0039960 were: 

Forward primer: 

5* GTATTTCTTTAA 3' 

Reverse primer: 

i 
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5' GAGTCTTCCCAA 3' 

The multiplex primers for SNP TSC0418134 were: 

Forward primer: 

5* CAGGTAGAGTAA 3* 

Reverse primer: 

5' ATAGGATGTGAA 3' 

The multiplex primers for SNP TSC0603688 were: 

Forward primer: 

5' CAATGTGTATAA 3' 

Reverse primer: 

5' AGAGGGCATCAA 3' 

The multiplex primers for SNP TSC0129188 were: 

Forward primer: 

5' CCAGTGGTCTAA 3' 

Reverse primer: 

5' TAAACAATAGAA 3' 

i 

The multiplex primers for SNP TSC1 103570 were: 
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Forward primer 

5' GCACACTTTTAA 3' 

Reverse primer: 

5' ATGGCTCTGCAA 3' 

The multiplex primers for SNP TSC0813449 were: 

Forward primer: 

5' GTCATCTTGTAA 3 9 

Reverse primer: 

5' TGCTTCATCTAA 3* 

The multiplex primers for SNP TSC0701940 were: 

Forward primer: 

5 ' AGAAAGGGGCAA 3 ' 

Reverse primer: 

5' CTTTTCTTTCAA V 

The multiplex primers for SNP TSC0087962 were: 
Forward primer: 
5 , CTACTCTCTCAA3 , 
Reverse primer: 
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5 



5' ACAGCATTATAA 3' 
The multiplex primers for SNP TSC0660274 were: 
Forward primer: 
5' ACTGCTCTGGAA 3' 
10 Reverse primer: 

5* GCAGAGGCACAA 3' 
Multiplex PCR 

15 

Regions on chromosome 13 surrounding the above-mentioned 29 SNPs were amplified 

from the template genomic DNA using the polymerase chain reaction (PCR, U.S. Patent Nos. 

4,683,195 and 4,683,202, incorporated herein by reference). This PCR reaction used primers that 

annealed approximately 150 bases upstream and downstream of each loci of interest. The fifty- 
20 eight primers were mixed together and used in a single reaction to amplify the template DNA. This 

reaction was done to increase the number of copies of the loci of interest, which eliminates error 

generated from a low number of genomes. 

For increased specificity, a "hot-start" PCR reaction was used. PCR reactions were 

performed using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). 
25 The amount of template DNA and primer per reaction can be optimized for each locus of interest. 

In this example, the 20 jxl of plasma template DNA was used. 

Two microliters of each forward and reverse primer, at concentrations of 5 mM were 

pooled into a single microcentrifuge tube and mixed. Four microliters of the primer mix was used 

in a total PCR reaction volume of 50 ^1 (20jli1 of template plasma DNA, 1 \i\ of sterile water, 4 |xl of 
30 primer mix, and 25 nl of HotStar Taq. Twenty-five cycles of PCR were performed. The following 

PCR conditions were used: 

(1) 95°C for 15 minutes; 

(2) 95°Cfor30 second; 

(3) 4°C for 30 seconds; 
35 (4) 37°C for 30 seconds; 
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(5) Repeat steps 2-4 twenty-four (24) times; 

(6) 72°C for 10 minutes. 

The temperatures and times for denaturing, annealing, and extension, can be optimized by 
trying various settings and using the parameters that yield the best results. 
5 Other methods of genomic amplification can also be used to increase the copy number of 

the loci of interest including but not limited to primer extension preamplification (PEP) (Zhang et 
al, PNAS, 89:5847-51, 1992), degenerate oligonucleotide primed PCR (DOP-PCR) (Telenius, et 
al, Genomics 13:718-25, 1992), strand displacement amplification using DNA polymerase from 
bacteriophage 29, which undergoes rolling circle replication (Dean et al, Genomic Research 
10 11:1095-99, 2001), multiple displacement amplification (U.S. Patent 6,124,120), REPLI-g™ Whole 
Genome Amplification kits, and Tagged PCR. 

Purification of Fragment of Interest 

15 The unused primers, and nucleotides were removed from the reaction by using Qiagen 

MinElute PCR purification kits (Qiagen, Catalog Number 28004). The reactions were performed 
following the manufacturer's instructions supplied with the columns. The DNA was eluted in 100 
|xl of sterile water. 

20 PCR Reaction Two 

Design of Primers 

SNPTSC0052277 was amplified using the following primer set: 

25 

First primer: 

5' CTCCGTGGTATGGAATTCCACTCAAATCTTCATTCAGA 3 ' 
30 Second primer: 

5' ACGTCGGGTTACGGGACACCTGATTCCTC 3' 

SNP TSC1225391 was amplified using the following primer set: 

35 
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First primer: 

5' TACCATTGGTTTGAATTCTTGTTTCCTGTTAACCATGC 3' 

Second primer: 
5' GCCGAGTTCTACGGGACAGAAAAGGGAGC 3 ' 

SNP TSC0289078 was amplified using the following primer set: 

First primer: 

5' TGCAGTGATTTCGAATTCGAGACAATGCTGCCCAGTCA 3 1 

Second primer: 
5' TCTAAATTCTCTGGGACCATTCCTTCAAC 3* 

SNP TSC1349804 was amplified using the following primer set: 

First primer: 

5' ACTAACAGCACTGAATTCCATGCTCTTGGACTTTCCAT 3' 

Second primer: 
5' TCCCCTAACGTTGGGACACAGAATACTAC 3' 

SNP TSC0870209 was amplified using the following primer set: 

First primer: 

5' GTCGACGATGGCGAATTCCTGCCACTCATTCAGTTAGC 3' 
Second primer: 
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5' GAACGGCCCACAGGGACCTGGCATAACTC 3' 

SNP TSC0194938 was amplified using the following primer set: 
First primer: 

5' TCATGGTAGCAGGAATTCTGCTTTGACCGATAAGGAGA 3' 

Second primer: 
5' ACTGTGGGATTCGGGACTGTCTACTACCC 3' 

SNP TSC0820373 was amplified using the following primer set: 

First primer: 

» 

5' ACCTCTCGGCCGGAATTCGGAAAAGTGTACAGATCATT 3' 

Second primer: 
5' GCCGGATACGAAGGGACGGCTCGTGACTC 3' 

SNP TSC0902859 was amplified using the following primer set: 

First primer: 

5' CCGTAGACTAAAGAATTCCCTGATGTCAGGCTGTCACC 3' 

Second primer: 
5' ATCGGATCAGTCGGGACGGTGTCTTTGCC 3' 

t 

SNP TSC0501510 was amplified using the following primer set: 
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First primer: 

5' GCATAGGCGGGAGAATTCCCTGTGTCCACCAAAGTCGG 3' 

Second primer: 
5' CCCACATAGGGCGGGACAAAGAGCTGAAC 3' 

SNP TSC 1228234 was amplified using the following primer set: 

First primer: 

5' GGCTTGCCGAGCGAATTCTAGGAAAGATACGGAATCAA 3' 

Second primer: 
5' TAACCCTCATACGGGACTTTCATGGAAGC 3' 

SNP TSC0082910 was amplified using the following primer set: 

First primer: 

5' ATGAGCACCCGGGAATTCTGATTGGAGTCTAGGCCAAA 3' 

Second primer: 
5' TGCTCACCTTCTGGGACGTGGCTGGTCTC V 

SNP TSC0838335 was amplified using the following primer set: 

First primer: 

5' ACCGTCTGCCACGAATTCTGGAAAACATGCAGTCTGGT 3' 
Second primer: 
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5' TACACGGGAGGCGGGACAGGGTGATTAAC 3 9 

SNP TSC0818982 was amplified using the following primer set: 
First primer: 

5' CTTAAAGCTAACGAATTCAGAGCTGTATGAAGATGCTT 3' 

Second primer: 
5' AACGCTAAAGGGGGGACAACATAATTGGC 3' 

SNP TSC0469204 was amplified using the following primer set: 

First primer: 

5' TTGTAAGAACGAGAATTCTGCAACCTGTCTTTATTGAA 3' 

Second primer: 
5' CTTCACCACTTTGGGACACTGAAGCCAAC 3' 

SNP TSC1084457 was amplified using the following primer set: 

First primer: 

5' AACCATTGATTTGAATTCGAAATGTCCACCAAAGTTCA 3' 

Second primer: 
5' TGTCTAGTTCCAGGGACGCTGTTACTTAC 3' 

SNP TSC0466177 was amplified using the following primer set: 
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First primer: 

5' CGAAGGTAATGTGAATTCTGCCACAATTAAGACTTGGA 3' 
5 Second primer: 

5' ATACCGGTTTTCGGGACAGATCCATTGAC 3' 

SNP TSC1270598 was amplified using the following primer set: 

10 

First primer: 

5' CCTGAAATCCACGAATTCCACCCTGGCCTCCCAGTGCA 3 9 
15 Second primer: 

5' TAGATGGTAGGTGGGACAGGACTGGCTTC 3' 

SNP TSC1002017 was amplified using the following primer set: 

20 

First primer: 

5' GCATATCTTAGCGAATTCCTGTGACTAATACAGAGTGC 3' 
25 Second primer: 

5 s CCAAATATGGTAGGGACGTGTGAACACTC 3' 

SNP TSC1 1 04200 was amplified using the following primer set: 

30 

First primer: 

5' TGCCGCTACAGGGAATTCATATGGCAGATATTCCTGAA 3' 
35 Second primer: 
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5' ACGTTGCGGACCGGGACTTCCACAGAGCC 3' 

SNP TSC0501389 was amplified using the following primer set: 
First primer: 

5' CTTCGCCCAATGGAATTCGGTACAGGGGTATGCCTTAT 3* 
Second primer: 

5' TGCACTTCTGCGGGGACCAGAGGAGAAAC 3' 

SNP TSC0039960 was amplified using the following primer set: 
First primer: 

5' TGTGGGTATTCTGAATTCCACAAAATGGACTAACACGC 3' 

Second primer: 
5' ACGTCGTTCAGTGGGACATTAAAAGGCTC 3' 

SNP TSC0418134 was amplified using the following primer set: 

r 

First primer: 

5' GGTTATGTGTCAGAATTCTGAAACTAGTTTGGAAGTAC 3' 

Second primer: 
5' GCCTCAGTfTCGGGGACAGTTCTGAGGAC 3' 

SNP TSC0603688 was amplified using the following primer set: 
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First primer: 

5 9 TGTAACACGGCCGAATTCCTCATTTGTATGAAATAGGT 3' 

Second primer: 
5' AATCTAACTTGAGGGACCGGCACACACAC 3> 

SNP TSC0129188 was amplified using the following primer set: 

First primer: 

5' AGTGTCCCCTTAGAATTCGCAGAGACACCACAGTGTGC 3> 

Second primer: 
y TTTGCTACAGTCGGGACCCTTGTGTGCTC 3' 

SNP TSC1 103570 was amplified using the following primer set: 

First primer: 

5' AGCACATCACTAGAATTCAATACCATGTGTGAGCTCAA 3' 

Second primer: 
5* AATCCTGCTTCCGGGACCTAACTTTGAAC 3' 

SNP TSC08 13449 was amplified using the following primer set: 

i 

First primer: 

5' TTTCATTTTCTGGAATTCCTCTAATGATTTTCTGGAGC 3' 
Second primer: 
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5' CGTCGCCGCGTAGGGACTTTTTCTTCCAC 3' 

SNP TSC0701940 was amplified using the following primer set: 
First primer: 

5' TTACTTAATCCTGAATTCGAGAAAAGCCATGTTGATAA 3' 

Second primer: 
5' TCATGGGTCGCTGGGACTTTGCCCTCTGC 3' 

SNP TSC0087962 was amplified using the following primer set: 

First primer: 

5' ACTAACAGCACTGAATTCATTTTACTATAATCTGCTAC 3' 

Second primer: 
5' GTTAGCCGAGAAGGGACTGTCTGTGAAGC 3' 

SNP TSC0660274 was amplified using the following primer set: 

First primer: 

5' AAATATGCAGCGGAATTCGTAAGTGACCTATTAATAAC 3' 

Second primer: 
5' GCGATGGTTACGGGGACAGCCAGGCAACC 3' 

Each first primer had a biotin tag at the 5' end and contained a restriction enzyme 
recognition site for EcoRI, and was designed to anneal at a specified distance from the locus of 
interest. This allows a single reaction to be performed for the loci of interest, as each loci of interest 
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will migrate at a distinct position (based on annealing position of first primer). The second primer 

contained a restriction enzyme recognition site for BsmF L 

All loci of interest were amplified from the multiplexed template DNA using the 

polymerase chain reaction (PCR, U.S. Patent Nos. 4,683,195 and 4,683,202, incorporated herein by 
5 reference). In this example, the loci of interest were amplified in separate reaction tubes but they 

could also be amplified together in a single PCR reaction. For increased specificity, a "hot-start" 

PGR was used. PCR reactions were performed using the HotStarTaq Master Mix Kit supplied by 

QIAGEN (catalog number 203443). 

The amount of multiplexed template DNA and primer per reaction can be optimized for 
10 each locus of interest. One microliter of the multiplexed template DNA eluted from the MinElute 

column was used in the PCR reaction for each locus of interest, and 5 ^M of each primer was used. 

The twenty-nine SNPs described above also were amplified from the maternal DNA (15 ng of DNA 

was used in the PCR reaction; primer concentrations were as stated above). Forty cycles of PCR 

were performed. The following PCR conditions were used: 
15 (1) 95°C for 15 minutes and 15 seconds; 

(2) 37°C for 30 seconds; 

(3) 95°Cfor30 seconds; 

(4) 57°C for 30 seconds; 

(5) 95°Cfor30 seconds; 
20 (6) 64°C for 30 seconds; 

(7) 95°Cfor30 seconds; 

(8) Repeat steps 6 and 7 thirty nine (39) times; 

(9) 72°C for 5 minutes. 

In the first cycle of PCR, the annealing temperature was about the melting temperature of 
25 the 3' annealing region of the second primers, which was 37°C. The annealing temperature in the 
second cycle of PCR was about the melting temperature of the 3* region, which anneals to the 
template DNA, of the first primer, which was 57°C. The annealing temperature in the third cycle of 
PCR was about the melting temperature of the entire sequence of the second primer, which was 
64°C. The annealing temperature for the remaining cycles was 64°C. Escalating the annealing 
30 temperature from TM1 to TM2 to TM3 in the first three cycles of PCR greatly improves specificity. 
These annealing temperatures are representative, and the skilled artisan will understand the 
annealing temperatures for each cycle are dependent on the specific primers used. 

The temperatures and times for denaturing, annealing, and extension, can be optimized by 
trying various settings and using the parameters that yield the best results. In this example, the first 
35 primer was designed to anneal at various distances from the locus of interest. The skilled artisan 
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understands that the annealing location of the first primer can be 5-10, 1 1-15, 16-20, 21-25, 26-30, 
31-35, 36-40, 41-45, 46-50, 51-55, 56-60, 61-65, 66-70, 71-75, 76-80, 81-85, 86-90, 91-95, 96-100, 
101-105, 106-110, 111-115, 116-120, 121-125, 126-130, 131-140, 140-160, 160-180, 180-200, 
200-220, 220-240, 240-260. 260-280. 280-300, 300-350, 350-400, 400-450, 450-500, or greater 
5 than 500 bases from the locus of interest. 

Purification of Fragment of Interest 

The PCR products were separated from the genomic template DNA. Each PCR product 
was placed into a well of a Streptawell, transparent, High-Bind plate from Roche Diagnostics 
GmbH (catalog number 1 645 692, as listed in Roche Molecular Biochemicals, 2001 Biochemicals 

10 Catalog). Alternatively, the PCR products can be pooled into a single well because the first primer 
was designed to allow the loci of interest to separate based on molecular weight. The first primers 
contained a 5' biotin tag so the PCR products bound to the Streptavidin coated wells while the 
genomic template DNA did not. The streptavidin binding reaction was performed using a 
Thermomixer (Eppendorf) at 1000 rpm for 20 min. at 37°C. Each well was aspirated to remove 

15 unbound material, and washed three times with IX PBS, with gentle mixing (Kandpal et a!., Nucl. 
Acids Res. 18:1789-1795 (1990); Kaneoka et al, Biotechniques 10:30-34 (1991); Green et al., 
Nucl. Acids Res. 18:6163-6164(1990)). 

Restriction Enzyme Digestion of Isolated Fragments 

The purified PCR products were digested with the restriction enzyme BsmF I, which binds 
20 to the recognition site incorporated into the PCR products from the second primer. The digests 
were performed in the Streptawells following the instructions supplied with the restriction enzyme. 
After digestion, the wells were washed three times with PBS to remove the cleaved fragments. 

Incorporation of Labeled Nucleotide 

The restriction enzyme digest with BsmF I yielded a DNA fragment with a 5 f overhang, 
25 which contained the SNP site or locus of interest and a 3* recessed end. The 5* overhang functioned 
as a template allowing incorporation of a nucleotide or nucleotides in the presence of a DNA 
polymerase. 

As demonstrated in Example 6, the sequence of both alleles of a SNP can be determined by 
fdling in the overhang with one labeled nucleotide in the presence of the other unlabeled 
30 nucleotides. The following components were added to each fill in reaction: 1 |xl of fluorescently 
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labeled ddGTP, 0.5 |xl of unlabeled ddNTPs ( 40 jiM), which contained all nucleotides except 
guanine, 2 \xl of 10X sequenase buffer, 0.25 \il of Sequenase, and water as needed for a 20jil 
reaction. The fill in reaction was performed at 40°C for 10 min. Non-fluorescently labeled ddNTP 
was purchased from Fermentas Inc. (Hanover, MD). All other labeling reagents were obtained 
from Amersham (Thermo Sequenase Dye Terminator Cycle Sequencing Core Kit, US 79565). 

After labeling, each Streptawell was rinsed with IX PBS (100 |*1) three times. The "filled 
in" DNA fragments were then released from the Streptawells by digestion with the restriction 
enzyme EcoRI, according to the manufacturer's instructions that were supplied with the enzyme. 
Digestion was performed for 1 hour at 37 °C with shaking at 120 rpm. 

Detection of the Locus of Interest 

After release from the streptavidin matrix, the sample was loaded into a lane of a 36 cm 5% 
acrylamide (urea) gel (BioWhittaker Molecular Applications, Long Ranger Run Gel Packs, catalog 
number 50691). The sample was electrophoresed into the gel at 3000 volts for 3 min. The gel was 
run for 3 hours on a sequencing apparatus (Hoefer SQ3 Sequencer). The gel was removed from the 
apparatus and scanned on the Typhoon 9400 Variable Mode Imager. The incorporated labeled 
nucleotide was detected by fluorescence. 

Below a schematic of the 5' overhang for SNP TSC0838335 is depicted. The entire 
sequence is not reproduced, only a portion to depict the overhang (where R indicates the variable 
site). 

10/14 5' TAA 

3' ATT R A C A 

Overhang position 12 3 4 

The observed nucleotides for TSC0838335 are adenine and guanine on the 5' sense strand 
(herein depicted as the top strand). The nucleotide in position three of the overhang corresponded 
to cytosine, which is complementary to guanine. Labeled ddGTP can be used to determine the 
sequence of both allele in the presence of unlabeled dATP, dCTP, and dTTP. 

The restriction enzyme BsmF I was used to create the 5' overhang, which typically cuts 
10/14 from the recognition site. At times, BsmF I will cut 1 1/15 from the recognition site and 
generate the following overhang: 

11/15 5'TA 
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3' AT T R A C 

Overhang position 0 12 3 

Position 0 in the overhang is thymidine, which is complementary to adenine. Position 0 
complementary to the overhang was filled in with unlabeled dATP, and thus after the fill-in 
reaction, the exact same molecules were generated whether the enzyme cut at 10/14 or 1 1/15 from 
the recognition site. The DNA molecules generated after the fill-in reaction are depicted below: 



10 



G allele 10/14 



Overhang position 
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A allele 10/14 
Overhang position 
A allele 11/15 
Overhang position 



5'TAAA 
3'ATTT 



5'TA 
3* AT 



T 
A 
1 

A 
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G* 

C 
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A 
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T 
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25 The maternal template DNA amplified for TSC0838335 displayed a single band mat 

migrated at the expected position of the higher molecular weight band, which corresponded to the 
"A" allele (see FIG. 20, lane 1). The maternal template DNA was homozygous for adenine at SNP 
TSC0838335. 

However, in lane 2, amplification of the multiplexed template DNA for TSC0838335 
30 isolated from the plasma of the same individual displayed two bands; a lower molecular weight 
band, which corresponded to the "G" allele, and the higher molecular weight band, which 
corresponded to the "A" allele. The template DNA isolated from the plasma of a pregnant female 
contains both maternal template DNA and fetal template DNA. 

As seen in FIG. 20, lane 1, the maternal template DNA was homozygous for adenine at this 
35 SNP (compare lanes 1 and 2). The "G" allele represented the fetal DNA. Signals from the 
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maternal template DNA and the fetal template DNA clearly have been distinguished. The "G" 
allele becomes a beacon for the fetal DNA and can be used to measure the amount of fetal DNA 
present in the sample. Additionally, once the percentage of fetal DNA in the maternal plasma for a 
given sample has been determined, any deviation from this percentage indicates a chromosomal 
5 abnormality. This method provides the first non-invasive method for the detection of fetal 
chromosomal abnormalities. 

As seen in FIG. 20, lane 3, analysis of the maternal DNA for SNP TSC0418134 generated a 
single band that migrated at the expected position of the higher molecular weight band, which 
corresponded to the adenine allele. Likewise, analysis of the multiplexed template DNA isolated 
10 from the maternal plasma gave a single band, which migrated at the expected position of the 
adenine allele (see FIG. 20, lane 4). Both the maternal DNA and the fetal DNA are homozygous 
for adenine at TSC0418134. 

Below, a schematic of the 5' overhang for TSC0129188 is depicted, wherein R indicates 
the variable site: 

15 

10/14 5'TCAT 

3'AGTA R A C T 
Overhang position 12 3 4 

20 The nucleotide upstream of the variable site (R) does not correspond to guanine on the 

sense strand. Thus, the 5' overhang generated by the 11/15 cutting properties of BsmF I will be 
filled-in identically to the 5'overhang generated by the 10/14 cut Labeled ddGTP in the presence 
of unlabeled dATP, dTTP, and dCTP was used for the fill-in reaction. The DNA molecules 
generated after the fill-in reaction are depicted below: 

25 

A allele 10/14 5'TCAT A T G* 

3'AGTA TACT 
Overhang position 12 3 4 

30 G allele 10/14 5'TCAT G* 

3'AGTA C A C T 
Overhang position 12 3 4 

Analysis of the maternal DNA for SNP TSC0129188 gave a single band that corresponded 
35 to the DNA molecules filled in with ddGTP at position 1 complementary to the overhang, which 
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represented the "G" allele (see FIG. 20, lane 5). No band was detected for adenine allele, indicating 
that the maternal DNA is homozygous for guanine. 

In contrast, analysis of the multiplexed template DNA from the maternal plasma, which 
contains both maternal DNA, and fetal DNA, gave two distinct bands (see FIG. 20, lane 6). The 
5 lower molecular weight band corresponded to the "G" allele, while the higher molecular weight 
corresponded to the "A" allele. The "A" allele represents the fetal DNA. Thus, a method has been 
developed that allows separation of maternal DNA and fetal DNA signals without the added 
complexity of having to isolate fetal cells. In addition, a sample of paternal DNA is not required to 
detect differences between the maternal DNA and the fetal DNA. 

10 Analysis of the maternal DNA for SNP TSC0501389 gave a single band that migrated at 

the higher molecular weight position, which corresponded to the "A" allele. No band was detected 
that corresponded to the "G" allele. Similarly, analysis of the multiplexed template DNA from the 
maternal plasma for SNP TSC0501389 gave a single band that migrated at the higher molecular 
weight position, which corresponded to the "A" allele. Both the maternal template DNA and the 

15 fetal template DNA were homozygous for adenine at SNP TSC0501389. 

The maternal DNA and the template DNA from the plasma originated from the same 
sample. One sample, which was obtained through a non-invasive procedure, provided a genetic 
fingerprint for both the mother and the fetus. 

Of the twenty-nine SNPs for which the maternal template DNA was homozygous, the fetal 

20 template DNA was heterozygous at two of the twenty-nine SNPs. The fetal DNA was homozygous 
for the same allele as the maternal template DNA at the remaining 27 SNPs (data not shown). 
Comparing the homozygous allele of the maternal template DNA and the plasma template DMA at 
a given SNP provides an added level of quality control. It is not possible that the maternal template 
DNA and the plasma template DNA are homozygous for different alleles at the same SNP. If this 

25 is seen, it would indicate that an error in processing had occurred. 

The methods described herein demonstrate that the maternal genetic signal can be separated 
and distinguished from the fetal genetic signal in a maternal plasma sample. The above-example 
analyzed SNPs located on chromosome 13, however any chromosome can be analyzed including 
human chromosome 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, X and 

30 Y and fetal chromosomes 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, X 
andY. 

In addition, the methods described herein can be used to detect fetal DNA in any biological 
sample including but not limited to cell, tissue, blood, serum, plasma, saliva, urine, tears, vaginal 
secretions, umbilical cord blood, chorionic villi, amniotic fluid, embryonic tissues, lymph fluid, 
35 cerebrospinal fluid, mucosa secretions, peritoneal fluid, ascitic fluid, fecal matter, or body exudates. 
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The methods described herein demonstrate that the percentage of fetal DNA in the maternal 
sample can be determined by analyzing SNPs wherein the maternal DNA is homozygous, and the 
DNA isolated from the plasma of the pregnant female is heterozygous. The percentage of fetal 
DNA can be used to determine if the fetal genotype has any chromosomal disorders. 

For example, if the percentage of fetal DNA present in the sample is calculated to be 30% 
by analysis of chromosome 1 (chromosomal abnormalities involving chromosome 1 terminate early 
in the pregnancy), then any deviation from 30% fetal DNA is indicative of a chromosomal 
abnormality. For example, if upon analysis of a SNP or multiple SNPs on chromosome 18, the 
percentage of fetal DNA is higher than 30%, this would indicate that an additional copy of 
chromosome 18 is present. The calculated percentage of fetal DNA from any chromosome can be 
compared to any other chromosome. In particular, the percentage of fetal DNA on chromosome 13 
can be compared to the percentage of fetal DNA on chromosomes 1 8 and 21 . 

This analysis is assisted by knowledge of the expected ratio of one allele to the other allele 
at each SNP. As discussed in Example 9, not all heterozygous SNPs display ratios of 50:50. 
Knowledge of the expected ratio of one allele to the other reduces the overall number of variable 
sites that must be analyzed. However, even without knowledge of the expected ratios for the 
various SNPs, the percentage of fetal DNA can be calculated by analyzing a large number of SNPs. 
When the sampling size of SNPs is large enough, the statistical variation arising from the values of 
the expected ratios will be eliminated. 

In addition, heterozygous maternal SNPs also provide valuable information. The analysis 
is not limited to homozygous maternal SNPs. For example, if at a heterozygous SNP on maternal 
DNA, the ratio of allele 1 to allele 2 is 1:1, then in the plasma template DNA the ratio should remain 
1:1 unless the fetal DNA carries a chromosomal abnormality. 

The above methods can also be used to detect mutations in the fetal DNA including but not 
limited to point mutations, transitions, transversions, translocations, insertions, deletions, and 
duplications. As seen in FIG. 20, fetal DNA can readily be distinguished from maternal DNA. The 
above methods can be used to determine the sequence of any locus of interest for any gene. 



EXAMPLE 14 

Plasma isolated from blood of a pregnant female contains both maternal template DNA and 
fetal template DNA. As discussed above, fetal chromosomal abnormalities can be determined by 
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analyzing SNPs wherein the maternal template DNA is homozygous and the template DNA 
obtained from the plasma displays a heterozygous pattern. 

For example, assume SNP X can either be adenine or guanine, and the maternal DNA for 
SNP X is homozygous for guanine. The labeling method described in Example 6 can be used to 
5 determine the sequence of the DNA in the plasma sample. If the plasma sample contains fetal 
DNA, which is heterozygous at SNP X, the following DNA molecules are expected after digestion 
with the type IIS restriction enzyme BsmF I, and the fill-in reaction with labeled ddGTP, unlabeled 
dATP, dTTP, and dCTP. 

10 Maternal Allele 1 5' GGGT G* 

3'CCCA C T C A 

Maternal Allele 2 5' GGGT G* 

3'CCCA C T C A 

15 

Fetal Allele 1 5' GGGT G* 

3'CCCA C T C A 

Fetal Allele 2 5' GGGT A A G* 

20 3'CCCA T T C A 

Two signals are seen; one signal corresponds to the DNA molecules filled in with ddGTP at 
position one complementary to the overhang and the second signal corresponds to the DNA 
molecules filled in with ddGTP at position three complementary to the overhang. However, the 

25 maternal DNA is homozygous for guanine, which corresponds to the DNA molecules filled in at 
position one complementaiy to the overhang. The signal from the DNA molecules filled in with 
ddGTP at position three complementary to the overhang corresponds to the adenine allele, which 
represents the fetal DNA. This signal becomes a beacon for the fetal DNA, and can used to 
measure the amount of fetal DNA present in the plasma sample. 

30 There is no difference in the amount of fetal DNA from one chromosome to another. For 

instance, the percentage of fetal DNA in any given individual from chromosome 1 is the same as 
the percentage of fetal DNA from chromosome 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, X and Y. Thus, the allele ratio calculated for SNPs on one chromosome can be 
compared to the allele ratio for the SNPs on another chromosome. 
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For example, the allele ratio for the SNPs on chromosome 1 should be equal to the allele 
ratio for the SNPs on chromosomes 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 
21, 22, X, and Y. However, if the fetus has a chromosomal abnormality, including but not limited 
to a trisomy or monosomy, the ratio for the chromosome that is present in an abnormal copy 
5 number will differ from the ratio for the other chromosomes. 

To recapitulate the in vivo scenario of blood from a pregnant female, maternal DNA was 
mixed with DNA isolated from her child, who previously was diagnosed with Trisomy 21, in 
various ratios to represent varying percentages of fetal DNA. For example, to replicate the in vivo 
scenario of 50% fetal DNA in maternal blood, equal amounts of maternal DNA were mixed with 
10 DNA isolated from her child with Down's syndrome. The maternal DNA was analyzed to identify 
homozygous SNPs, and these SNPs then were analyzed using the mixture of 50% maternal DNA 
and 50% Down's syndrome DNA. The ratio of allele 1 to allele 2 at heterozygous SNPs on 
chromosome 13 was compared to the ratio of allele 1 to allele 2 at heterozygous SNPs on 
chromosome 21. 

15 Four different samples were analyzed: a sample with 100% of the DNA from a child with 

Down syndrome; a sample with 75% DNA from the child with Down syndrome and 25% DNA 
from the child's mother; a sample with 50% DNA from the child with Down syndrome and 50% 
DNA from the child's mother; and a sample with 40% DNA from the child with Down syndrome 
and 60% DNA from the child's mother. The maternal DNA was analyzed to identify homozygous 

20 SNPs. The DNA isolated from the child with Down syndrome was genotyped to identify 
heterozygous SNPs. Then, the samples were genotyped at SNPs where the maternal DNA was 
homozygous and the DNA from the child was heterozygous. For each sample, these SNPs were 
analyzed ten times. 

25 Collection of Blood Samples 

An Internal Review Board approved study was designed to allow collection of blood 
samples from children afflicted with Down's syndrome and their parents. For this study, blood was 
collected from the mother, the father, and the child with Down's syndrome.' Informed consent to 
collect blood from the child with Down's syndrome was granted by the parents as well as the child. 
30 Blood was collected into 9 ml EDTA Vacuette tubes (catalog number NC9897284). The tubes 
were stored at 4°C until ready for processing. 

Isolation of Plasma and Maternal Cells 

The blood was stored at 4°C until processing. The tubes were spun at 1000 rpm for ten 
35 minutes in a centrifuge with braking power set to zero. The tubes were spun a second time at 1000 
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rpm for ten minutes. The supernatant (the plasma) of each sample was transferred to a new tube 
and spun at 3000 rpm for ten minutes with the brake set to zero. The supernatant was transferred to 
a new tube and stored at -80°C. Approximately two milliliters of the "buffy coat," which contains 
maternal cells, was placed into a separate tube and stored at -80°C. 

5 

Isolation of DNA 

DNA was isolated from the plasma sample using the Qiagen Midi Kit for purification of 
DNA from blood cells, following the manufacturer's instructions (QIAmp DNA Blood Midi Kit, 
Catalog number 51 183). DNA was eluted in 100 \xl of distilled water. The Qiagen Midi Kit also 
10 was used to isolate DNA from the maternal cells contained in the "bufly coat." Maternal DNA and 
the plasma DNA were isolated from the same tube of blood. 

Identification of Maternal Homozygous SNPs 

The maternal DNA was genotyped to identify homozygous SNPs. Seven hundred and 

15 sixty-eight SNPs on chromosome 13 and 768 SNPs on chromosome 21 were genotyped using the 
methods described in Example 6. Any number of SNPs can be analyzed, and the SNPs can be 
located on human chromosome 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 
22, X and Y. Preferably, the SNPs that are genotyped have allele frequencies of 50:50, 60:40, 
70:30, 80:20, or 90: 10. As described in Example 8, the allele frequency of any given SNP can be 

20 determined. 

Details regarding the SNPs located on chromosome 13 and 21 can be found at the SNP 
consortium database, which can be accessed via the internet at http://www.snpxshl.org . The 
primers were designed following the procedures set fourth in the Examples described above, for 
example, in Examples 1, 2, 3, 5, and 6. 

25 

The first primers were designed so that after digestion with a Type lis enzyme, the products 
had different molecular weights as described in Example 6. This allowed the amplified products to 
be pooled, and run in a single lane of a gel. 

For example, the first primer can be designed such that after digestion a 30 base pair 
30 product is generated. Likewise, the first primer of a different locus of interest can be designed such 
that after digestion a 40 base pair product is generated. The first primers can be designed so that in 
a single reaction, numerous loci can be analyzed in one lane of a gel (30, 40, 50, 60, 70, 80, 90, 100, 
110, and 120 base pair products can be run in a single lane). The first primer can be designed to 
anneal any distance from the locus of interest including but not limited to between 5-10, 10-25, 
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25-50, 50-75, 75-100, 100-150, 150-200, 200-250, 250-300, 300-350, 350-400, 400-450, 450-500, 
500-550, 550-600, 600-650, 650-700, 700-750, 750-800, 800-850, 850-900, 900-950, 950-1000 and 
greater than 1000 bases. 

Amplification of the Loci of Interest 

For each SNP that was genotyped, a PCR reaction was used to amplify the loci of interest. 
The PCR reactions were performed in 96-well plates. The first and second primer (3 jllI of 1.25 \M 
stock concentration) for each SNP was distributed into a well of a microtiter plate. Eight 96-well 
PCR plates were set-up for chromosome 21 and eight 96-well plates were set-up for chromosome 
13. After the primers had been distributed into the wells of the microtiter plates, a mixture 
containing the genomic DNA and HotStar PCR reagents was added to each well. Each PCR 
reaction contained 3 p\ of each primer, 7.5 fil of HotStar Taq Master mix, 0.5 jitl of water, and 1 \xl 
of genomic DNA (10 ng/fil). 

The PCR cycling conditions were as follows: 

(1) 95°C for 15 minutes and 15 seconds; 

(2) 37°C for 30 seconds; 

(3) 95°C for 30 seconds; 

(4) 52°C for 30 seconds; 

(5) 95°C for 30 seconds; 

(6) 58°C for 30 seconds; 

(7) 95°C for 30 seconds; 

(3) Repeat steps 6 and 7 thirty seven (37) times; 
(9) 72°C for 5 minutes. 

Purification of Fragment of Interest 

After the PCR reaction, 3 jjJ of a PCR product generated with a first primer designed to 
produce a 30 base pair product, 3 \il of a PCR product generated with a first primer designed to 
produce a 40 base pair product, 3 jxl of a PCR product generated with a first primer designed to 
produce a 50 base pair product, 3 |d of a PCR product generated with a first primer designed to 
produce a 60 base pair product, 3 |JiI of a PCR product generated with a first primer designed to 
produce a 70 base pair product, 3 jllI of a PCR product generated with a first primer designed to 
produce a 80 base pair product, 3 y.1 of a PCR product generated with a first primer designed to 
produce a 90 base pair product, 3 ^1 of a PCR product generated with a first primer designed to 
produce a 100 base pair product were mixed together in a well of a Streptawell, transparent, High- 
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Bind plate from Roche Diagnostics GmbH (catalog number 1 645 692, as listed in Roche Molecular 
Biochemicals, 2001 Biochemicals Catalog). The first primers contained a 5 f biotin tag so the PCR 
products bound to the Streptavidin coated wells while the genomic template DNA did not The 
streptavidin binding reaction was performed using a Thermomixer (Eppendorf) at 1000 rpm for 20 
5 min. at 37°C. Each well was aspirated to remove unbound material, and washed three times with 
IX PBS, with gentle mixing (Kandpal et al., Nucl. Acids Res. 18:1789-1795 (1990); Kaneoka et al, 
Biotechniques 10:30-34 (1991); Green et al., Nucl. Acids Res. 18:6163-6164 (1990)). 

Restriction Enzyme Digestion of Isolated Fragments 

The purified PCR products were digested with the restriction enzyme BsmF I, which binds 
10 to the recognition site incorporated into the PCR products from the second primer. The digests 
were performed in the Streptawells following the instructions supplied with the restriction enzyme. 
After digestion, the wells were washed three times with PBS to remove the cleaved fragments. 

Incorporation of Labeled Nucleotide 

1 5 The restriction enzyme digest with BsmF I yielded a DNA fragment with a 5 ! overhang, 

which contained the SNP site or locus of interest and a 3' recessed end. The 5 f overhang functioned 
as a template allowing incorporation of a nucleotide or nucleotides in the presence of a DNA 
polymerase. As discussed in detail in Example 6, a single nucleotide labeled with one chemical 
moiety can be used to determine the sequence at a SNP. 

20 The amplified loci of interest were pooled into the streptavidin-well based on size, and on 

the nucleotide used in the fill-in reaction. The sequence of SNPs that were determined by using a 
guanine nucleotide were pooled together. Likewise, the sequence of SNPs that were determined by 
using an adenine nucleotide were pooled together; the sequence of SNPs that were determined by 
using a thymidine nucleotide were pooled together; and the sequence of SNPs that were determined 

25 by using a cytosine nucleotide were pooled together. 

Thus, a typical fill-in reaction contained 8 amplified loci, ranging in size of 30-120 base 
pair products; the sequence of all eight was determined using a single nucleotide labeled with one 
chemical moiety. Any number of amplified loci can be pooled together. 

The following components were added to each fill in reaction: 1 |xl of fluorescently labeled 

30 dideoxynucleotide (ddGTP for G fill-in reactions; ddATP for A fill-in reactions; ddTTP for 

thymidine fill-in reactions; and ddCTP for cytosine fill-in reactions), 0.5 \il of unlabeled dNTPs ( 
40 fiM), which contained all nucleotides except the labeled nucleotide, 2 |xl of 10X sequenase 
buffer, 0.25 fil of Sequenase, and water as needed for a 20fil reaction. 
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The fill in reaction was performed at 40°C for 10 min. Non-fluorescently labeled dNTP 
was purchased from Fermentas Inc. (Hanover, MD). All other labeling reagents were obtained 
from Amersham (Thermo Sequenase Dye Terminator Cycle Sequencing Core Kit, US 79565). 

After labeling, each Streptawell was rinsed with IX PBS (100 nl) three times. The "filled 
5 in" DNA fragments were then released from the Streptawells by digestion with the restriction 
enzyme EcoRI, according to the manufacturer's instructions that were supplied with the enzyme. 
Digestion was performed for 1 hour at 37 °C with shaking at 120 rpm. 



Detection of the Locus of Interest 

10 After release from the streptavidin matrix, the sample was loaded into a lane of a 36 cm 5% 

acrylamide (urea) gel (BioWhittaker Molecular Applications, Long Ranger Run Gel Packs, catalog 
number 50691). The sample was electrophoresed into the gel at 3000 volts for 3 min. The gel was 
run for 3 hours on a sequencing apparatus (Hoefer SQ3 Sequencer). The gel was removed from the 
apparatus and scanned on the Typhoon 9400 Variable Mode Imager. The incorporated labeled 

15 nucleotide was detected by fluorescence. The homozygous SNPs were identified. 



Identification of Heterozygous SNPs with the Trisomy 21 template 

The DNA isolated from the individual with Down syndrome (the child of the mother who 
was genotyped above) was analyzed to identify heterozygous SNPs. The same seven hundred and 
20 sixty-eight SNPs on chromosome 13 and the same 768 SNPs on chromosome 2 1 that were analyzed 
with the maternal DNA were genotyped using the methods described in Example 6. Any number of 
SNPs can be analyzed, and the SNPs can be located on human chromosome 1, 2, 3, 4, 5, 6, 7, 8, 9, 
10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, X or Y. Preferably, the SNPs that are genotyped 
have allele frequencies of 50:50, 60:40, 70:30, 80:20, or 90:10. As described in Example 8, the 
25 allele frequency of any given SNP can be determined. 

The process for genotyping the SNPS with the DNA isolated from the individual with 
Down syndrome was as described for the maternal DNA. The heterozygous SNPs were identified. 

SNPs that were homozygous for the maternal DNA and heterozygous for the DNA isolated 
from the individual with Down syndrome were further analyzed using samples that contained 
3 0 mixtures of maternal DNA and Down syndrome DNA. 



Generation of Samples containing maternal DNA and Down syndrome DNA 

The DNA and the DNA obtained from her child, who has Down's syndrome, were 
quantitated using a spectrophotometer. The maternal DNA and the child's DNA were mixed 
35 together at various percentages to represent the situation of circulating fetal DNA in the maternal 
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blood. The following percentages were analyzed: 100% Down's syndrome DNA, 75% Down's 
syndrome DNA, 50% Down's syndrome DNA, and 40% Down's syndrome DNA. 

The ratio at each hetero2ygous SNP was calculated by dividing the value obtained for allele 
1 by the value obtained for allele 2. For example, if SNP X can either be adenine (A) or guanine 
5 (G), the ratio at SNP X was calculated by dividing the value obtained for adenine by the value 
obtained for guanine. 

For the sample containing 100% Down syndrome DNA, sixty-two SNPs on chromosome 
13, which were homozygous with the maternal DNA and heterozygous with the DNA isolated from 
the individual with Down syndrome, were analyzed. For chromosome 21, forty-nine SNPs were 

10 analyzed that were homozygous with the maternal DNA and heterozygous with the DNA isolated 
from the individual with Down syndrome. 

The 62 SNPs on chromosome 13 and 49 SNPs on chromosome 21 were analyzed ten 
separate times. As shown in Table XX, for each of the ten trials, the ratio of allele 1 to allele 2 on 
chromosome 13 was approximately 1.0 as expected. For chromosome 13, there is one copy of 

15 allele 1 and one copy of allele 2. The average of the ten trials was 1.051 with a standard deviation 
of 0.085. 

With a Trisomy 21, there are two copies of one allele, which are usually inherited from the 
mother, and one copy of the other allele. The expected ratio is approximately 0.5 (one copy of 
allele 1/ two copies of allele 2). As shown in Table XX, the ratio for chromosome 21 varied from a 

20 low of 0.462 to a high of 0.634. For every trial, the ratio obtained for chromosome 21 was 

significantly distinct from the ratio obtained at chromosome 13. The average ratio for the ten trials 
was 0.53 1 with a standard deviation of 0.049. 

The experiment was repeated ten times so that a true statistical measurement could be 
obtained. If ten different genetic samples were used, the SNPs that fit the criteria (maternal 

25 homozygous, Down syndrome child heterozygous) would be different, making it difficult to 
compare from sample to sample. 

Statistical analysis revealed a confidence value of 99.9% that the ratios obtained on 
chromosome 13 and on chromosome 21 represented true differences, rather than random numerical 
fluctuations in value. The Ravgen method identified the presence of the chromosomal abnormality. 

30 For the sample containing 75% Down syndrome DNA and 25% maternal DNA, sixty two 

SNPS on chromosome 13 and fifty SNPs on chromosome 21 were analyzed, unless stated 
otherwise. For various trials, not all the SNPS could be quantitated because the bands 
corresponding to certain SNPs were faint. This may have been caused by poor PCR amplification, 
poor binding to the streptavidin plate, or a weak fill-in reaction. 
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For trial 3, 61 SNPs on chromosome 13 were analyzed. For trail 4, 49 SNPs were analyzed 
on chromosome 21. With regard to trial 5, 47 SNPs on chromosome 21 were analyzed and 61 
SNPs on chromosome 13. For trial 7, 49 SNPs were analyzed on chromosome 21 and 61 SNPs on 
chromosome 13. For trial 8, 49 chromosomes were analyzed on chromosome 21, and 59 SNPs 
5 were analyzed on chromosome 13. For trials 9 and 10, 59 SNPs on chromosome 13 were analyzed. 

The expected ratio on chromosome 13 for a heterozygous SNP is 0.6. If the maternal 
chromosomes both contain an adenine nucleotide, and the Down syndrome genome is comprised of 
one chromosome with an adenine nucleotide and one chromosome with a guanine nucleotide, then 
the ratio of G:A is 0.75/ (.75 (Down syndrome A allele ) +0 .25 + 0.25 (maternal A alleles)), which 
10 is 0.6. For the ten trials, the ratios obtained for chromosome 13 varied from 0.567 to 0.645. The 
average for the ten trials was 0.609 with a standard deviation of 0.032 (see Table XX). 

The expected ratio for chromosome 21 in a Trisomy condition is 0.375. If the maternal 
chromosomes both contain an adenine nucleotide, and the Down syndrome genome is comprised of 
two chromosomes with an adenine nucleotide and one chromosome with a guanine nucleotide, then 
15 the ratio of G:A is 0.75/(0.75 + 0.75 (Down syndrome A alleles) + 0.25 + 0.25 (maternal A 
alleles)), which is 0.375. 

For the ten trials, the ratios obtained for chromosome 21 varied from 0.350 to 0.4125, with 
an average of 0.384 and a standard deviation of 0.017 (see Table XX). Statistical analysis revealed 
a confidence value of 99.9% that the ratios obtained on chromosome 13 and on chromosome 21 
20 represented true differences, rather than random numerical fluctuations in value. The Ravgen 
method identified the presence of the chromosomal abnormality in the presence of 25% maternal 
DNA. 

With regard to the sample containing 50% Down syndrome DNA, 46 SNPs on 
chromosome 13 and 35 SNPs on chromosome 21 were analyzed, unless stated otherwise. For trial 

25 1, 45 SNPs on chromosome 13 were analyzed. For trail 2, 44 SNPs on chromosome 13 were 
analyzed. For trial 3, 42 SNPs on chromosome 13 were analyzed. For trial 4, 44 SNPs on 
chromosome 13 and 34 SNPs on chromosome 21 were analyzed. For trial 5, 34 SNPs on 
chromosome 21 were analyzed. For trials 7 and 8, 44 and 41 SNPs on chromosome 13, 
respectively, were analyzed. For trial 9, 44 SNPs on chromosome 13 and 34 SNPs on chromosome 

30 21 were analyzed. For trial 10, 44 SNPs on chromosome 13 were analyzed. 

The expected ratio at a heterozygous SNP on chromosome 13 for the 50% sample is 0.33. 
If the maternal chromosomes both contain an adenine nucleotide, and the Down syndrome genome 
is comprised of one chromosome with an adenine nucleotide and one chromosome with a guanine 
nucleotide, then the ratio of G:A is 0.50/ (.50 (Down syndrome A allele ) +0 .50 + 0.50 (maternal A 

35 alleles)), which is 0.33. For the ten trials, the ratios obtained for chromosome 13 varied from 0.302 
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to 0.347. The average for the ten trials was 0.324 with a standard deviation of 0.013 (see Table 
XX). 

The expected ratio for chromosome 21 in a Trisomy condition is 0.25. If the maternal 
chromosomes both contain an adenine nucleotide, and the Down syndrome genome is comprised of 
5 two chromosomes with an adenine nucleotide and one chromosome with a guanine nucleotide, then 
the ratio of G:A is 0.50/(0.50 + 0.50 (Down syndrome A alleles) + 0.50 + 0.50 (maternal A 
alleles)), which is 0.25. 

For the ten trials, the ratios obtained for chromosome 21 varied from 0.230 to 0.275, with 
an average of 0.244 and a standard deviation of 0.015 (see Table XX). Statistical analysis revealed 
10 a confidence value of 99.1% that the ratios obtained on chromosome 13 and on chromosome 21 
represented true differences, rather than random numerical fluctuations in value. The Ravgen 
method identified the presence of the chromosomal abnormality in the presence of 50% maternal 
DNA. 

For the sample containing 40% Down syndrome DNA, 60 SNPs on chromosome 13 and 48 

15 SNPs on chromosome 21 were analyzed, unless stated otherwise. For trial '1, 47 SNPs on 
chromosome 21 were analyzed. For trials 2-4, 59 SNPs on chromosome 13 and 47 SNPs on 
chromosome 21 were analyzed. For trials 5 and 6, 46 SNPs on chromosome 21 were analyzed. For 
trail 7, 58 SNPs on chromosome 13 were analyzed. For trial 8, 46 SNPs on chromosome 21 were 
analyzed and for trials 9 and 10, 47 SNPs on chromosome 21 were analyzed. 

20 The expected ratio at a heterozygous SNP on chromosome 13 for the 40% Down syndrome 

DNA sample is 0.25. If the maternal chromosomes both contain an adenine nucleotide, and the 
Down syndrome genome is comprised of one chromosome with an adenine nucleotide and one 
chromosome with a guanine nucleotide, then the ratio of G:A is 0.40/ (.40 (Down syndrome A 
allele ) + 0.60 + 0.60 (maternal A alleles)), which is 0.25. For the ten trials, the ratios obtained for 

25 chromosome 13 varied from 0.254 to 0.285. The average for the ten trials was 0.269 with a 
standard deviation of 0.009 (See Table XX). 

The expected ratio for chromosome 21 in a Trisomy condition is 0.20. If the maternal 
chromosomes both contain an adenine nucleotide, and the Down syndrome genome is comprised of 
two chromosomes with an adenine nucleotide and one chromosome with a guanine nucleotide, then 

30 the ratio of G:A is 0.40/(0.40 + 0.40 (Down syndrome A alleles) + 0.60 + 0.60 (maternal A 
alleles)), which is 020. 

For the ten trials, the ratios obtained for chromosome 2 1 varied from 0.216 to 0.249, with 
an average of 0.23 and a standard deviation of 0.01 1 (see Table XX). Statistical analysis revealed a 
confidence value of 94.3% that the ratios obtained on chromosome 13 and on chromosome 21 

35 represented true differences, rather than random numerical fluctuations in value. The Ravgen 
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method identified the presence of the chromosomal abnormality in the presence of 60% maternal 
DNA. 

The presence of the Trisomy 21 condition was identified with the Ravgen method in 
numerous samples that contained various percentages of abnormal DNA. Each percentage of 
5 abnormal DNA was'analyzed ten separate times and each time, the presence of the abnormal 
condition was identified. The ratio of allele 1 to allele 2 at multiple heterozygous SNPs on 
chromosome 13 was calculated, and the ratios were averaged. The same was done with the SNPs 
located on chromosome 21. The ratio obtained for the heterozygous SNPs on chromosome 13 was 
statistically different from the ratio obtained on chromosome 21. The ratios obtained on both 
10 chromosome 13 and 21 were near the mathematically predicted values. 

In this example, the confidence interval for the samples with 100% Down syndrome DNA 
and 75% Down syndrome DNA was 99.9%, and the confidence interval for the sample with 50% 
Down syndrome DNA was 99. 1%, which is about the accuracy reported for amniocentesis. The 
confidence interval for the sample containing 40% Down syndrome DNA was 94.3%, which is 
15 more accurate than currently marketed non-invasive tests for prenatal diagnostics. 

As discussed above, about 60 SNPs on chromosome 13 and 50 SNPs on chromosome 21 
were analyzed. To increase the confidence interval for samples containing 40% fetal DNA or 
lower, a larger number of SNPs can be analyzed. The Ravgen method provides a highly accurate, 
cost-effective way to sequence DNA, so sequencing a larger number of SNPs is not difficult. The 
20 accuracy of the test is determined by the number of SNPs that are sequenced. For higher accuracy 
with samples that contain lower percentages of DNA, more SNPs can be analyzed. Alternatively, 
the methods described in this application can be used to ensure that the samples contain a higher 
percentage of fetal DNA. 

In this example, a sample containing 40% Down syndrome DNA, which represented the 
25 fetal DNA in the maternal blood, was analyzed. Maternal blood samples with any percentage of 
fetal DNA can be analyzed including but not limited to 0.0001-1%, 1-10%, 10-20%, 20-30%, 30- 
40%, 40-50%, 50-60%, 60-70%, 70-80%, 80-90%, and 90-100%. 

Table XX. The Ravgen method identifies chromosomal abnormalities in 
30 Samples containing 40% Down syndrome DNA 





100% DS DNA 


~75%DSDNA 


Ratio at 
Chrom. 


13 


21 


Expected 
21 


13 


21 


Expected 
21 


Trial 1 


0.959 


0.5708 


0.49 


0.637 


.3764 


0.389 
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Trial 2 


0.916 


0.5024 


0.48 


0.567 


.3894 


0.362 


Trial 3 


1029 


0.4616 


0.51 


0.651 


.3707 


0.394 


Trial 4 


0.967 


0.5123 


0.491 


0.580 


.3901 


0.367 


Trial 5 


1.037 


0.6339 


0.51 


0.645 


.4125 


0.392 


Trial 6 


1.111 


0.5425 


0.53 


0.645 


.3743 


0.392 


Trial 7 


1.154 


0.495 


0.54 


0.594 


.3974 


0.373 


Trial 8 


1.135 


0.5276 


0.532 


0.583 


.3901 


0.368 


Trial 9 


1.148 


0.5619 


0.534 


0.579 


.3899 


0.367 


Trial 10 


1.057 


0.4976 


0.52 


0.609 


.350 


0.378 


AVG. 




.531 


0312 


.6091 


0.384 


0.378 , 


STDEV 


,085 


•049 




>032i 






















~50%DSDNA 


~40%DSDNA 


Ratio at 
Chrom. 


13 


21 


Expected 
21 


13 


21 


Expected 
21 


Trial 1 


0.347 


0.275 


0.258 


0.277 


0.239 


0.217 


Trial 2 


0.316 


0.237 


0.24 


0.265 


0.249 


0.21 


Trial 3 


0.338 


0.247 


0.253 


0.266 


0.227 


0.21 1 


Trial 4 


0.331 


0.264 


0.249 


0.254 


0.216 


0.202 


Trial 5 


0.330 


0.241 


0.248 


0.274 


0.246 


0.215 


Trial 6 


0.324 


0.240 


0.244 


0.268 


0.22 


0.211 


Trial 7 


0.318 


0.233 


0.241 


0.275 


0.227 


0.216 


Trial 8 


0.302 


0.230 


0.231 


0.258 


0.228 


0.21 


Trial 9 


0.315 


0.238 


0.240 


0.285 


0.231 


0.222 


Trial 10 


0.318 


0.235 


0.241 


0.266 


0.218 


0.21 


AVG. 


0.324 


0.244 


0.244 


0269 


0.23 


0.212 


STDEV 


0.013 


0.015 




0.009 


0.011 





EXAMPLE 15 



As discussed in Example 4 above, the use of cell lysis inhibitors, cell membrane stabilizers, 
5 or cross-linking reagents can be used to increase the percentage of fetal DNA in the maternal blood. 
In this example, methods for the isolation of free fetal DNA are disclosed, which minimize the 
amount of maternal cell lysis. The effect of formalin on sixty-nine (69) maternal blood samples 
from twenty-seven clinical practices located in sixteen different states was analyzed. Formalin was 
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added to all samples collected from the pregnant women, and the percentage of fetal DNA was 
calculated using serial dilution analysis followed by PCR. A genetic marker on the Y chromosome 
was used to calculate the percent of fetal DNA. 

5 Collection of Blood Samples 

In accordance with an IRB approved study, blood samples were collected from pregnant 
women after informed consent had been granted. Blood samples were received from 27 different 
clinical sites operating in 16 different states located throughout the U.S. Blood samples were 
collected from both women carrying male and female fetuses, however, here, we report results 

10 obtained from woman carrying male fetuses, as the Y chromosome is the accepted marker when 
quantitating percentages of fetal DNA. 

Blood is collected by any method or process that results in a substantial increase in the ratio 
of fetal DNA/maternal DNA in the resulting serum or plasma after appropriate processing. As used 
herein, a substantial increase in the ratio of fetal DNA/mpternal DNA is that which can be detected 

15 by the methods as described herein. Such methods or processes typically result in a substantial 
increase in the ratio of fetal DNA/maternal DNA of about 5%, 10%, 15%, 20%, 30%, 50%, 70%, 
80%, 100% or more of the ratio of fetal DNA/maternal DNA found in blood samples collected by 
standard procedures. 

In other embodiments, blood is collected by any method or process that results in a 
20 substantial increase in the amount of free fetal DNA compared to the amount of total DNA 

recovered or detected in the resulting serum or plasma after processing. Such methods or processes 
typically result in a substantial increase so the fetal DNA recovered or detected is about 10%, 15%, 
20%, 25%, 30%, 40%, 50% or more of the total DNA recovered or detected in the processed 
plasma or serum sample. 

25 All clinical sites were provided with a kit used for the venipuncture procedure, which 

included 21 gauge needles, 9 ml EDTA Vacuette tubes (catalog number NC9897284) a syringe 
containing 0.225 ml of 10% neutral buffered solution containing formaldehyde (4% w/v), an 
icepack, and a shipping container. The clinical sites were instructed to add, the formaldehyde 
immediately after drawing the blood and to gently invert the tubes. 

30 The methods or processes of collecting blood samples may also include other steps that 

result in lessened or reduced cell lysis. For instance, blood collection devices may be modified to 
decrease cell lysis due to sheer forces in the collection needle, syringe or tubes used. For instance, 
needles of large gauge may be employed to reduce cell sheering or vacutainer tubes may be 
modified to reduce the velocity of blood flow. 

35 
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Isolation of Plasma 

Any method may be used to isolate plasma from the cell components of blood after 
collection but methods wherein cell lysis is substantially prevented, reduced or inhibited are 
preferred. The blood was stored at 4°C until processing. Methods for isolation of the plasma were 
5 implemented to reduce the amount of maternal cell lysis. The tubes were spun at 1000 rpm for ten 
minutes in a centrifuge with braking power and acceleration power set at zero to substantially 
prevent, reduce or inhibit cell lysis and or mixing of blood cell components into the plasma. The 
tubes were spun a second time at 1000 rpm for ten minutes with braking power (centrifuge stopped 
by natural deceleration) and acceleration power set to zero. The supernatant (the plasma) of each 

10 sample was transferred carefully to a n<ew tube and spun at 3000 rpm for ten minutes with the brake 
and acceleration power set at zero. The supernatant (the plasma) of each sample was collected via 
procedures to substantially prevent mixing of cell components into the plasma. Great care was 
taken to ensure that the buffy-coat was not disturbed. A percentage of the supernatant can be left in 
the tube including but not limited to 0.001-1%, 1-10%, 10-20%, 20-30%, 30-40%, 40-50%, 50- 

15 60%, 60-70%, 70-80% or greater than 80%. In this example, about 0.5 ml of the supernatant was 
left in the tube to ensure that the buffy-coat was not disturbed. The supernatant was transferred to a 
new tube and stored at -80°C. 

Isolation ofDNA 

20 DNA was isolated from the plasma sample using the Qiagen Midi Kit for purification of 

DNA from blood cells, following the manufacturer's instructions (QIAmp DNA Blood Midi Kit, 
Catalog number 51 183). DNA was eluted in 100 pi of distilled water. However, any method of 
DNA isolation can be used including cesium chloride gradients, gradients, sucrose gradients, 
glucose gradients, centrifugation protocols, boiling, Qiagen purification systems, QIA DNA blood 

25 purification kit, HiSpeed Plasmid Maxi Kit, QIAfilter plasmid kit, Promega DNA purification 

systems, MangeSil Paramagnetic Particle based systems, Wizard S V technology, Wizard Genomic 
DNA purification kit, Amersham purification systems, GFX Genomic Blood DNA purification kit, 
Invitrogen Life Technologies Purification Systems, CONCERT purification system, Mo Bio 
Laboratories purification systems, UltraClean BloodSpin Kits, and UlraClean Blood DNA Kit. The 

30 skilled artisan understands that the manufacturer's protocols can modified to increase the yield of 
DNA. For example, the Qiagen Midi Kit for purification of DNA recommends the use of IX AL 
buffer. However, any concentration of AL buffer may be used if the yield of DNA increases 
including but not limited to 0.1-0.5X AL buffer, 0.5-1X AL buffer, 1X-2X AL buffer, 2-3X AL 
buffer, 3-4X AL buffer, 4-5X AL buffer, and greater than 5X AL buffer. The skilled artisan 

35 understands that the modifications and manipulations of the reagents are nojt limited to AL buffer. 
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Quantification of Percentage of Fetal DNA 

The percentage of fetal DNA present in the maternal plasma sample was calculated using 
serial dilution analysis followed by PCR. Two different sets of primers were used: one primer set 
was specific for the Y chromosome, and thus specific for fetal DNA, and the other primer set was 
designed to amplify the cystic fibrosis gene, which is present on both maternal template DNA and 
fetal template DNA. 

Primer Design: 

The following primers were designed to amplify the SRY gene on the Y chromosome: 

Upstream primer: 
5' TGGCGATTAAGTCAAATTCGC 3' 

Downstream primer: 
5* CCCCCTAGTACCCTGACAATGTATT 3' 

The following primers were designed to amplify the cystic fibrosis gene: 

Upstream primer: 
5' CTGTTCTGTGATATTATGTGTGGT 3' 

Downstream primer: 
5' AATTGTTGGCATTCCAGCATTG 3' 

PCR Reaction 

The SRY gene and the cystic fibrosis gene were amplified from the template genomic DNA 
using PCR (U.S. Patent Nos. 4,683,195 and 4,683,202). For increased specificity, a "hot-start" 
PCR was used. PCR reactions were performed using the HotStarTaq Master Mix Kit supplied by 
Qiagen (Catalog No. 203443). For amplification of the SRY gene, the DNA eluted from the 
Qiagen purification column was diluted serially 1 :2. For amplification of the cystic fibrosis gene, 
die DNA eluted from the Qiagen purification column was diluted 1:4, and then serially diluted 1:2. 
The following components were used for each PCR reaction: 8 jil of template DNA (diluted or 
undiluted), 1 jil of each primer (5 yM), 10 of HotStar Taq mix. The following PCR conditions 
were used: 
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(1) 95°Cforl5' 

(2) 94°Cforl 9 

(3) 54°Cforl5" 

(4) 72°Cfor30" 

5 (5) Repeat steps 2-4 for 45 cycles. 

(6) 10'at72°C 

Amplification of the SRY gene was performed using the following templates: undiluted, 
diluted 1:2, diluted 1:4, diluted 1:8, diluted 1:16, diluted 1:32, diluted 1:64, diluted 1:128, diluted 
10 1 :256, and diluted 1:512. Amplification of the cystic fibrosis gene was performed using the 

following templates: diluted 1:4, diluted 1:8, diluted 1:16, diluted 1:32, diluted 1:64, diluted 1:128, 
diluted 1:256, diluted 1:512, diluted 1:1024, diluted 1:2048, and diluted 1:4096. 

The percent of fetal DNA present in the maternal plasma was calculated using the 
following formula: 

15 % fetal DNA = (amount of SRY gene/amount of cystic fibrosis gene)*2* 1 00. 

The amount of SRY gene was represented by the highest dilution value in which the gene was 
amplified. Likewise, the amount of cystic fibrosis gene was represented by the highest dilution 
value in which it was amplified. The formula contains a multiplication factor of two (2), which is 

20 used to normalize for the fact that there is only one copy of the SRY gene (located on the Y 
chromosome), while there are two copies of the cystic fibrosis gene. 

The effect of formalin on sixty-nine (69) maternal blood samples collected from twenty- 
seven clinical practices located in sixteen different states, spanning from Washington to 
Massachusetts is shown in Table XXI. In this study, formalin was added to all samples collected 

25 from the pregnant women, and the percentage of fetal DNA was calculated using serial dilution 
analysis followed by PCR. The serial dilutions and PCR amplifications were performed by four 
different scientists over a period of five months. The samples were collected from women at 
gestational ages ranging from 1 1 weeks to 28 weeks, with the majority of women between 16-19 
weeks of gestation. A summary is provided in Table XXHI. 

30 The average percentage of free fetal DNA for the 69 samples analyzed in the maternal 

blood was 33.6%. Lo et ah reported fetal DNA concentrations of 3.4% in woman in late first to 
mid-second trimester, which was the gestational age of the majority of women in this study. Thus, 
the addition of formalin led to approximately a ten-fold increase in the average percentage of fetal 
DNA. 
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While the calculated percentage of fetal DNA in maternal blood is impressive, it is also 
informative to examine the range of the percentages of fetal DNA observed in this study. About six 
percent of the women (4/69) had 3.125% of free fetal DNA in the maternal blood, which was the 
lowest percentage of fetal DNA observed in this study. Another 10.2% of women had 6.25% fetal 
5 DNA, which represents a two-fold increase over the reported average in the literature. The total 
number of women who had less than 10% fetal DNA in the maternal blood was only 16.0%. 

Fifty-eight percent of the women in this study had a percentage of fetal DNA of 25% or 
greater. Importantly, 26.0% of the women had fifty percent or greater fetal DNA in the maternal 
blood. Fetal DNA percentages of this magnitude have not been reported, and represent a new tool 
10 to the field of prenatal genetics. 

There were four samples collected from women at the gestational age of eleven weeks. The 
percentages of fetal DNA in the maternal blood samples were as follows: two samples at 12.5%; 
one sample at 25%; and one sample at greater than 50%. Thus, the effect of formalin on the 
percentages of fetal DNA was observed with samples collected from women in early as well as later 
1 5 gestational periods. 

The effect of stabilizing cell membranes and reducing the release of free DNA was not 
limited to formalin. We have tested several different types of agents, and combinations of agents, 
that prevent cell lysis and/or stabilize cell membranes, such as glutaraldehyde, and have seen that 
these agents also reduce the amount of free DNA in the blood sample (data not shown). 
20 The above described methods may also include steps of adding an agent to the blood 

sample at the time or near to the time of collection to substantially inhibit or impede cell lysis or 
stabilize cell membranes. Any number of agents that impede cell lysis or stabilize cell membranes 
or cross-link cell membranes can be added to the maternal blood samples including but not limited 
to formaldehyde, and derivatives of formaldehyde, formalin, glutaraldehyde, and derivatives of 
25 glutaraldehyde, crosslinkers, primary amine reactive crosslinkers, sulfhydryl reactive crosslinkers, 
sulfhydryl addition or disulfide reduction, carbohydrate reactive crosslinkers, carboxyl reactive 
crosslinkers, photoreactive crosslinkers, cleavable crosslinkers, AEDP, APG, BASED, BM(PEO) 3 , 
BM(PEO) 4 , BMB, BMDB, BMH, BMOE, BS3, BSOCOES, DFDNB, DMA, DMP, DMS, DPDPB, 
DSG, DSP, DSS, DST, DTBP, DTME, DTSSP, EGS, HBVS, sulfo-BSOCOES, Sulfo-DST, 
30 Sulfo-EGS or the compounds listed in Table XXHI. Additional cross-linkers that can be used are 
found at the following website: www.piercenet.com/products/ . 

An agent that stabilizes cell membranes may be added to the maternal blood sample to 
reduce maternal cell lysis including but not limited to aldehydes, urea formaldehyde, phenol 
formaldehyde, DMAE (dimethylaminoethanol), cholesterol, cholesterol derivatives, high 
35 concentrations of magnesium, vitamin E, and vitamin E derivatives, calcium, calcium gluconate, 
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taurine, niacin, hydroxylamine derivatives, bimoclomol, sucrose, astaxanthin, glucose, 
amitriptyline, isomer A hopane tetral phenylacetate, isomer B hopane tetral phenylacetate, 
citicoline, inositol, vitamin B, vitamin B complex, cholesterol hemisuccinate, sorbitol, calcium, 
coenzyme Q, ubiquinone, vitamin K, vitamin K complex, menaquinone, zonegran, zinc, ginkgo 
5 biloba extract, diphenylhydantoin, perftoran, polyvinylpyrrolidone, phosphatidylserine, tegretol, 
PABA, disodium cromglycate, riedocromil sodium, phenytoin, zinc citrate, mexitil, dilantin, sodium 
hyaluronate, or polaxamer 1 88. 

Any concentration of agent that stabilizes cell membranes, impedes cell lysis or cross-link 
cell membranes can be added. In a preferred embodiment, the agent that stabilizes cell membranes, 
1 0 impedes cell lysis, or cross-links cell membranes is added at a concentration that does not impede 
or hinder subsequent reactions. 

i 

While impressive percentages of free fetal DNA in maternal blood samples have been 
reported, it is thought that higher percentages can be achieved by carefully explaining the 
importance of the formalin to the physicians. Samples randomly were checked for the presence of 
15 formalin and found that about ten percent of the samples did not receive formalin. In addition, 

aggregates were observed in another ten percent of the samples suggesting that the formalin had not 
been thoroughly mixed with the collected blood. Thus, while the addition of formalin produced an 
impressive effect, it is likely that under controlled conditions, the percentage of free fetal DNA may 
be higher. 

20 In addition, we believe that procedures to minimize hemolysis during the venipuncture 

procedure and temperature controlled shipping containers (specimens were shipped in a Styrofoam 
container with ice pack, but there was variation in temperature because samples were shipped from 
varying distances) may cause a further increase in the percentage of free fetal DNA. Needles 
designed to reduce hemolysis can be used during the venipuncture procedure. 

25 Also, we hypothesized that procedures for carefully isolating the plasma would help to 

ensure a minimal amount of maternal DNA in the sample. We implemented procedures, as 
described above, to reduce cell lysis, such as gentle centrifugation parameters, and allowed the 
rotors to stop without external force (no brake). Also, we carefully removed the supernatant 
containing the plasma DNA from the buffy-coat, which contains maternal DNA. These procedures 

30 coupled with the addition of formalin to prevent cell lysis resulted in a tremendous increase in the 
percentage of fetal DNA. 

Table XXI. Formalin increases the percentage of free fetal DNA in blood samples collected at 
35 numerous clinical sites from women at various stages of gestation. 
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Sample 


Wks 
Gestation 


Fetal Genomes/ ml 


% Fetal DNA 


1 


16 


80 


25 


2 


19 


1066 


>50 


3 


17 


52 


50 


4 


22 


166 


25 


5 


32 


457 


50 


6 


19 


400 


100 


7 


18 


800 


100 


8 


17 


100 


50 


9 


16 


50 


25 


10 


17 


25 


12.5 


11 


16 


94.74 


12.5 


12 


16 


34.60 


50 


13 


16 


22.5 


25 


14 


17 


50 


12.5 


15 


17 


26.48 


12.5 


16 


17 


45.00 


25 


17 


17 


94.7 


100 


18 


17 


28.13 


6 25 


19 


19 


28.13 


25 


20 


20 


11 25 


12.5 


21 


15 


11.25 


12 5 


22 


11 


16.66 


12 5 


23 


18 


13.23 


25 


24 


18 


12.50 


6 25 


25 


16 


112 50 


100 


26 


17 


124.13 


25 


27 


14 


90.00 


50 


28 




100.00 


100 


29 


18 


232.00 


100 


30 


19 


626.00 


100 


31 


19 


112.50 


100 


32 


16 


423.50 


100 


33 


16 


423.50 


25 


34 


11 


105.88 


25 


35 


16 


49.60 


3.1 


36 


11 


11.84 


12 5 


37 


16 


120.00 


25 


38 


18 


34290 


100 


39 


17 


5L43 


25 


40 1 


18 


225.00 


6.25 


41 


17 


400.00 


12.5 


42 


28 


180.00 


25 


43 


17 


20.45 


12.5 


44 


18 


25.73 


25 


45 


16 


68.68 


3.1 


46 


17 


218.18 


25 


47 


15 


75.00 


6 25 


48 


16 


40.58 


3.1 


49 


17 


100.00 


25 n 


50 


17 


14.06 


12.5 


51 


22 


22.50 


12 5 


52 


15 


28.13 


12 5 


53 


17 


50.00 


3.125 


54 


18 


58.00 


50 


55 


14 


100.00 


25 


56 


16 


58.08 


25 


57 


16 


13.64 


12 5 


58 


16 


25.00 


6.25 


59 


20 


45.00 


25 


60 


16 


23.69 


12.5 


61 


18 


5.92 


6.25 


62 


15 


28.13 


6.25 


63 


17 


50.00 


25 


64 


16 


360.00 


50 


65 


16 


25.00 


12.5 


66 


16 


48.65 


25 


67 


16 


47.38 


12.5 


68 


14 


26.45 


50 


69 


17 


124.15 


25 


Average 


17 


131.15 


33.6 
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Table XXII. Formalin increases the percentage of free fetal DNA in blood samples collected at 
numerous clinical sites from women at various stages of gestation. 

5 



% Fetal DNA 


3.125 


6.25 


12.5 


25 


50 


Over 50% 


Number 
Women (69) 


4 


7 


18 1 


22 


7 


11 


% 


5.8 


10.1 


26.1 


31.9 


10.2 


15.9 



Table XXIII. A representative list of cross-linkers that can be used to impede maternal cell lysis. 



Cross-Linker 


Abbreviation 


succinimidyl acetylthioacetate 


SATA 


siuccinimidyl *raras-4-(maleimidylmethyl) 
cyclohexane-l-carboxylate 


SMCC 


succinimidyl 3-(2-pyridyldithio)propionate 


SPDP 


iV-((2-pyridyldithio)ethyl)-4-azidosalicylamide 


PEAS; AET 


4-azido-2,3,5,64etrafluorobenzoic acid, 
succinimidyl ester 


ATFB, SE 


4-azido-2,3,5,6-tetrafluorobenzoic acid, STP 
ester, sodium salt 


ATFB, STP ester 


4-azido-2,3,5,6-tetrafluorobenzyl amine, 
hydrochloride 




benzophenone-4-isothiocyanate 




benzophenone-4-maleimide 




4-benzoylbenzoic acid, succinimidyl ester 




Disuccinimidylsuberate 


DSS 


Dithiobis(succinimidylpropionate) 


DSP 


3,3'-Dithiobis(sulfosuccinimidylpropionate) 


DTSSP 


Bis[2- 

(sulfosuccinimdooxycarbonyloxy)ethyl]sulfone 


SULFO BSOCOES 


Bis[2- 

(succinimdooxycarbonyloxy)ethyl]sulfone 


BSOCOES 
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Disulfosuccinimdyltartrate 


SULFO DST 


Disuccinimdyltartrate 


DST 


Ethylene glycolbis(succinimidylsuccinate) 


SULFO EGS 


Ethylene glycolbis(sulfosuccinimidylsuccinate) 


EGS 


pyridyldithio)propionamido]butane 


DPDPB 


Bis(sulfosuccinimdyl)suberate 


BSSS 


Succinimdyl-4-(p-maleimidophenyl)butyrate 


SMPB 


Sulfosuccinimdyl-4-(p- 
maleimidophenyl)butyrate 


i 

SULFO SMPB 


3-Maleimidobenzoyl-N-hydroxysuccinimide 
ester 


MBS 


3-Maleimidobenzoyl-N- 
hydroxysulfosuccinimide ester 


SULFO MBS 


N-Succinimidyl(4-iodoacetyl)aminobenzoate 


SIAB 


N-Sulfosuccinimidyl(4- 
iodoacetyl)aminobenzoate 


SULFO SIAB 


Succinimidyl-4-(N- 

maleimidomethyl)cyclohexane-l-carboxylate 


SMCC 


Sulfosuccinimidyl-4-(N- 
maleimidomethyl)cyclohexane-l-carboxylate 


SULFO SMCC 


Succinimidyl-6-[3-(2- 
pyridyldithio)propionamido)hexanoate 


NHSLCSPDP 


Sulfosuccinimidyl-6-[3-(2- 
pyridyldithio)propionamido)hexanoate 


SULFO NHS LCSPDP 


N-Succinimdyl-3-(2-pyridyldithio)propionate 


SPDP 


N-Hydroxysuccinimidylbromoacetate 


NHS BROMOACETATE 


N-Hydroxysuccinimidyliodoacetate 


NHS IODOACETATE 


4-(N-Maleimidophenyl)butyric acid hydrazide 
hydrochloride 


MPBH 


4-(N-Maleimidomethy l)cyclohexane- 1 - 
carboxylic acid hydrazide hydrochloride 


MCCH 


m-Maleimidobenzoic acid 
hydrazidehydrochloride 


MBH 
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N-(epsilon- 

MaleimidocaproyIoxy)sulfosuccinimide 


SULFOEMCS 


N-(epsilon-Maleimidocaproyloxy)succinimide 


EMCS 


N-(p-Maleimidophenyl)isocyanate 


PMPI 


N-(kappa-Maleimidoundecanoic acid) 
hydrazide 


KMUH 


Succinimidyl-4-(N-maleimidomethyl)- 
cyclohexane- 1 -carboxy(6-amidocaproate) 


LC SMCC 


N-(gamma- 

Maleimidobutryloxy)sulfosuccinimide ester 


SULFOGMBS 


Succinimidyl-6-(beta- 
maleimidopropionamidohexanoate) 


SMPH 


N-(kappa- 

Mdeimidoundecanoyloxy)sulfosuccinimide 
ester 


SULFO KMUS 


N-(gamma-Maleimidobutyrloxy)succinimide 


GMBS 


Dimethyladipimidate hydrochloride 


DMA 


Dimethylpimelimidate hydrochloride 


DMP 


Dimethylsuberimidate hydrochloride 


DMS 


Methyl-p-hydroxybenzimidate hydrochloride, 
98% 


MHBH(Wood's Reagent) 


Amine Reactive 




2?w[sulfosuccinimidyl] suberate 


BS3 


Bis[2- 

(succinimidooxycarbonyloxy)ethyl]sulfone 


BSOCOES 


Disuccinimidyl glutarate 


DSG 




DSP (Lomant's Reagent) 


1 3 5-Difluoro-2,4-dinitrobenzene 


DFDNB 


Dithiobis[succinimidylpropionate 


DTBP 


Bis-[b-(4-Azidosalicylamido)ethyl]disulfide 


BASED 






Sulfhydryl Reactive 
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BM[PEO] 3 (l,8-bis-Maleimidotriethyleneglycol 


BM[PEO] 3 


BM[PEO] 4 (l,ll-bis- 
Maleimidotetraethyleneglycol 


BM[PEO] 4 


1 ,4-bis-Maleimidobutane 


BMB 


1,4 bis-Maleimidyl-2,3-dihydroxybutane 


BMDB 


Bis-Maleimidohexane 


BMH 


Bis-Maleimidoethane 


BMOE 


l,4-Di43'-(2'-pyridyldithio> 
propionamido]butane 


DPDPB 


Dithio-bis-raaleimidoethane 


DTME 


1,6-Hexane-bis-vinylsulfone 


HBVS 


p-Azidobenzoyl hydrazide 


ABH 


Amine-Sulfhydryl Reactive 




N-[a-Maleimidoacetoxy]succinimide ester 


AMAS 


JV-[4-(p-Azidosalicylamido) butyl]-3 '-(2'- 
pyridyldithio)propionamide 


APDP 


^-[fi-Maleimidopropyloxy]succinimide ester 


BMPS 


N-e-Maleimidocaproic acid 


EMCA 


N~e-Maleimidocaproyloxy]succinimide ester 


EMCS 


N-[g-Maleimidobutyiyloxy]succinimide ester 


GMBS 


N-k-Maleimidoundecanoic acid 


KMUA 


Succinimidyl-4-(N- , 
Maleimidomethy l)cyclohexane- 1 -carboxy-(6- 
amidocaproate 


LC-SMCC 


Succinimidyl 6-(3-[2-pyridyldithio]- 
propionamido)hexanoate 


LC-SPDP 


m-Maleimidobenzoyl-N-hydroxysuccinimide 
ester 


MBS 


Succinimidyl 3-[bromoacetamido]propionate 


SBAP 


iV-Succinimidyl iodoacetate 


SIA 


iV-Succinimidyl[4-iodoacetyl]aminobenzoate 


SIAB 


Succinimidyl 4-[iV- 

maleimidomethyl]cyclohexane-l-carboxylate 


SMCC 
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Succinimidyl 4-[/?-maleimidophenyl]butyrate 


SMPB 


Succinimidyl-6-[J3- 


SMPH 


maleimidopropionamido]hexanoate 




4-Succinimidyloxycarbonyl-methyl-a-[2- 


SMPT 


pyridyldithio]toluene 


i 


A^Succinimidyl 3-[2-pyridyldithio]- 


SPDP 


propionamido 




N-e-Mdeimidocaproyloxy]sulfosuccinimide 


Sulfo-EMCS 


ester 




N-[g-MaIeimidobutyiyloxy]sulfosuccinimide 


Sulfo-GMBS 


ester 




iV-[k- 


Sulfo-KMUS 


Maleimidoundecanoyloxy]suIfosuccinimide 




ester 




4-Sulfosuccinitnidyl-6-methyl-a-(2- 


Sulfo-LC-SMPT 


pyridyldithio)toluamido]hexanoate 




Sulfosuccinimidyl 6-(3 L [2-pyridyldithio]- 


Sulfo-LC-SPDP 


propionamido)hexanoate 




m-Maleimidobenzoyl-N- 


Sulfo-MBS 


hydroxysulfosuccinimide ester 




jV-Sulfosuccinimidyl[4- 


Sulfo-SIAB 


iodoacetyljaminobenzoate 




Sulfosuccinimidyl 4-[JV- 


Sulfo-SMCC 


maleimidomethyl]cyclohexane-l-carboxylate 




Sulfosuccinimidyl-4-(P-Maleimidophenyl) 


Sulfo-SMPB 


Butyrate 




Amino Groups 




N-5-Azido-2-nitrobenzoyloxysuccinimide 


ANB-NOS 


Methyl N-succinimidyl adipate 


MSA 


N-Hydroxysuccinimidyl-4-azidosalicylicacid 


NHS-ASA 


7\T-Succinimidyl(4-azidophenyl)-l, 3'- 


SADP 


dithiopropionate 




Sulfosuccinimidyl 2-[7-amino-4- 


SAED 


methylcoumarin-3-acetamido]ethyl- 
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1,3 'dithiopropionate 




Sulfosuccinimidyl 2[m-azido-o- 
nitrobenzamido]-ethy 1- 1 ,3 '-dithiopropionate 


SAND 


iV-Succinimidyl-6-[4'-azido-2'- 
nitrophenylamino] hexanoate 


SANPAH 


Sulfosuccinimidyl-2-[p- 
azidosalicylamido]etiiyl-l,3'-dithiopropionate 


SASD 


Sulfosuccinimidyl- 
[perfluoroazidobenzamido]ethyl- 1 ,3 
dithiopropionate 


SFAD 


i^-Hydroxysulfosuccinimidyl-4-azidobenzoate 


Sulfo-HSAB 


Sulfosuccinimidyl[4-azidosalicylamido]- 
hexanoate 


Sulfo-NHS-LC-ASA 


JV-Sulfosuccinimidyl(4-azidophenyl)-l , 3 '- 
dithiopropionate 


Sulfo-SADP - 


iV-Sulfosuccinimidyl-6-[4'-azido-2'- 
nitrophenylamino] hexanoate 


Sulfo-SANPAH 


p-Azidophenyl glyoxal monohydrate 


APG 


N-B-Maleimidopropionic acid 


BMPA 


Carbohydrate Reactive-Photoreactive 




N-Succinimidyl-S-acetylthiopropionate 


SATP 


Sulfliydryl-Carbohydrate Reactive 




4-(4-N-Maleimidophenyl)butyric acid 
hydrazide hydrochloride 


MPBH 


3-(2-Pyridyldithio)propionyl hydrazide 


PDPH 


Sulfhydryl-carbonyl (aldehyde)/carboxyl 
reactive 




JV-[B-Maleimidopropionic acid]hydrazide«TFA 


BMPH 


N-e-Maleimidocaproic acid]hydrazide 


EMCH 


N-[k-Maleimidoundecanoic acidjhydrazide 


KMUH 


N-[p-Maleiraidophenyl]isocyanate 


PMPI 




TFCS 
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EXAMPLE 16 

Fetal chromosomal abnormalities are determined by analyzing SNPs wherein the maternal 
template DNA is homozygous and the template DNA obtained from the plasma is heterozygous. 
5 Plasma that is isolated from blood of a pregnant female contains both maternal template DNA and 
fetal template DNA. Any number of SNP detection methods can be used to analyze the maternal 
and plasma DNA. Any DNA microarray may be used including but not limited to commercially 
available and non-commercially available arrays. 

A DNA microarray can be designed to contain SNPs located on the chromosome or 

10 chromosomes of interest including but not limited to a DNA microarray containing SNPs located on 
chromosomes 13, 18, and 21, a DNA microarray containing SNPS located on chromosomes 13 and 
18, a DNA microarray containing SNPS located on chromosomes 13 and 21, a DNA microarray 
containing SNPS located on chromosomes 18 and 21, a DNA microarray containing SNPS located 
on chromosomes 13, 18, 21, 15, 22, X, Y, a DNA microarray containing SNPS located on each of 

15 the autosomal chromosomes and each of the sex chromosomes, a DNA microairay containing 
SNPS located on chromosome 13, a DNA microarray containing SNPS located on chromosome 18, 
a DNA microarray containing SNPS located on chromosome 21, a DNA microarray containing 
SNPS located on chromosome 15, a DNA microarray containing SNPS located on chromosome 17, 
a DNA microarray containing SNPS located on chromosome 22, a DNA microarray containing 

20 SNPS located on a single chromosome, and a DNA microarray containing SNPS located on 
multiple chromosomes. 

In this example, SNPs are analyzed by GeneChip HuSNP Arrays from Affymetrix, 
however any number of DNA arrays, including but not limited to GeneChip arrays, GenFlex Tag 
arrays, Mapping 10K Array, other Affymetrix arrays, and other DNA arrays can be used. 

25 

Collection of Blood Samples 

In accordance with an IRB approved study, blood samples are collected from pregnant 
women after informed consent is granted. Blood is collected into 9 ml EDTA Vacuette tubes 
(catalog number NC9897284) and 0.225 ml of 10% neutral buffered solution containing 
30 formaldehyde (4% w/v), is added to each tube, and each tube gently is inverted. The tubes are 
stored at 4°C until ready for processing. 

Any number of agents that impede cell lysis or stabilize cell membranes or cross-link cell 
membranes can be added to the tubes including but not limited to formaldehyde, and derivatives of 
formaldehyde, formalin, glutaraldehyde, and derivatives of glutaraldehyde, crosslinkers, primary 
35 amine reactive crosslinkers, sulfhydryl reactive crosslinkers, sulfhydryl addition or disulfide 
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reduction, carbohydrate reactive crosslinkers, carboxyl reactive crosslinkers, photoreactive 
crosslinkers, cleavable crosslinkers, AEDP, APG, BASED, BM(PEO) 3 , BM(PEO) 4 , BMB, BMDB, 
BMH, BMOE, BS3, BSOCOES, DFDNB, DMA, DMP, DMS, DPDPB, DSG, DSP, DSS, DST, 
DTBP, DTME, DTSSP, EGS, HBVS, sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS or compounds 
5 listed in Table XXIII. Any concentration of agent that stabilizes cell membranes, impedes cell lysis 
or cross-link cell membranes can be added. In a preferred embodiment, the agent that stabilizes cell 
membranes, impedes cell lysis, or cross-links cell membranes is added at a concentration that does 
not impede or hinder subsequent reactions. 

An agent that stabilizes cell membranes may be added to the maternal blood sample to 

10 reduce maternal cell lysis including but not limited to aldehydes, urea formaldehyde, phenol 
formaldehyde, DMAE (dimethylaminoethanol), cholesterol, cholesterol derivatives, high 
concentrations of magnesium, vitamin E, and vitamin E derivatives, calcium, calcium gluconate, 
taurine, niacin, hydroxylamine derivatives, bimoclomol, sucrose, astaxanthin, glucose, 
amitriptyline, isomer A hopane tetral phenylacetate, isomer B hopane tetral phenylacetate, 

15 citicoline, inositol, vitamin B, vitamin B complex, cholesterol hemisuccinate, sorbitol, calcium, 
coenzyme Q, ubiquinone, vitamin K, vitamin K complex, menaquinone, zohegran, zinc, ginkgo 
biloba extract, diphenylhydantoin, perftoran, polyvinylpyrrolidone, phosphatidylserine, tegretol, 
PABA, disodium cromglycate, nedocromil sodium, phenytoin, zinc citrate, mexitil, dilantin, sodium 
hyaluronate, or polaxamer 188. 

20 

Isolation of Plasma and Maternal Cells 

The blood is stored at 4°C until processing. The tubes are spun at 1000 rpm for ten minutes 
in a centrifuge with braking power set at zero. The tubes are spun a second time at 1000 rpm for 
ten minutes. The supernatant (the plasma) of each sample is transferred to a new tube and spun at 
25 3000 rpm for ten minutes with the brake set at zero. The supernatant is transferred to a new tube 
and stored at -80°C. Approximately two milliliters of the "buffy coat," which contains maternal 
cells, is placed into a separate tube and stored at -80°C. 

Isolation of DNA 

30 DNA is isolated from the plasma sample using the Qiagen Midi Kit for purification of DNA 

from blood cells, following the manufacturer's instructions (QIAmp DNA Blood Midi Kit, Catalog 
number 51 183). DNA is eluted in 100 jil of distilled water. The Qiagen Midi Kit also is used to 
isolate DNA from the maternal cells contained in the "buffy coat." 
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Identification of Homozygous Maternal SNPs 
HuSNP Assay 

5 The HuSNP assay is done as described by K. Lindblad-Toh et al. (Nature Biotechnology, 

Vol. 18, 1001-1005). The GeneChip® HuSNP™ Array is thought to enable whole genome surveys 
by simultaneously tracking nearly 1,500 SNPs dispersed throughout the genome. In this example, 
HuSNP array is used as a representative Affymetrix array, and is not meant to limit the use of other 
arrays including but not limited to GeneChip CYP450; and Affymetrix custom arrays that are 
10 designed to meet specific user requirements. 

PCR Amplification 

Maternal DNA is assayed according to the HuSNP protocol supplied by Affymetrix Inc. 

15 For each sample, 24 pools of primer pairs (50-100 loci/pool at 50 nM each) are mixed with 5 ng of 
maternal DNA., 5 mM MgCl 2 , 0.5 mM dNTPs, 1.25 U Amplitaq Gold (PE Biosystems, Foster City, 
CA), and the supplied buffer in 12.5 pi per pool. Samples are denatured for 5 min at 95°C followed 
by 30 cycles of 95°C for 30 s, 52°C + 0.2°C/cycle for 55s, and 72°C for 30s; 5 cycles of 95°C for 
30 s, 58°C for 55s, and 72°C for 30s and a final extension of 72°C for 7 min. A 1:1000 dilution of 

20 each pool is made by adding 1 pi of the amplification product to 999 ^1 of ddH20. After, 2.5jal of 
the 1: 1000 dilution is transferred to a new plate and amplified with 0.8pM biotinylated T7 and 0.8 
pM biotinylated T3 primers, 4mM MgC12, 0.4 mM dNTPs, 2.5 U Taq and the supplied buffer in 25 
pi for 8 min at 95°C, followed by 40 cycles of 95°C for 30 s, 55°C for 90s, and 72°C for 30 s, and a 
final extension of 72°C for 7 min. Then 1.5pl from each pool is tested for amplification on a 3% 

25 agarose gel. For each sample, the remainder of each the 24 pools is mixed and loaded on a 

Microcon-10 spin column (Amicon Bioseparations, Bedford, MA). Samples are concentrated by 
spinning the column for 20 min at 13,000 g at room temperature and are eluted by inverting the 
column and centrifiiging for 3 min at 3,000 g. Volumes 'are adjusted to 60 pi. 

A custom array can be designed using only the SNPs that are of interest. For example, a 

30 custom array may be designed that contains SNPs that are located on chromosomes 1, 13, 21, 18, 
15,X,andY. 

Additionally, any number of SNPs can be amplified including SNPs located on any human 
chromosome including chromosome 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 
21, 22, X or Y. Two representative SNPs on chromosome 13 and two representative SNPs on 
35 chromosome 21 are chosen. The genomic location and sequence of SNPs may be found at the SNP 
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consortium (http://snp.cshl.orgy If these SNPs are not present on the array, different SNPs can be 
chosen. 

SNP TSC0466917 (C/G), which is located on chromosome 13, is amplified using the 
following primers: 

5 

Upstream Primer: 

5 9 CCAGCTGGTAGAACTT 3' 

1 0 Downstream Primer: 

5 5 CCCAATAGACCTATAG V 

SNP TSC1 172576 (T/A), which is located on chromosome 13, is amplified using the 
following primers: 

15 

Upstream Primer: 

5' TAGCAGAATCTCTCAT 3' 

Downstream Primer: 
20 5' AGAGTATCTCATTTGTT 3' 

SNP TSC027162S (A/G), which is located on chromosome 21, is amplified using the 
following primers: 

25 Upstream Primer: 

5' AGGAAATTGTGAAGTA V 

Downstream Primer: 

5' TAACTCACTCACTATC 3' 

30 

SNP TSC0069805 (C/T), which is located on chromosome 21, is amplified using the 
following primers: 

Upstream Primer: 
35 5' CTGCTGAGTCATAGTC 3* 
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Downstream Primer: 

5' TGTTCTTTGAATCAAC V 

5 Hybridization to GeneChip Probe Arrays, Washing and Staining 

5-30 \i\ of the sample (depending on the intensity of the chip lot) is diluted in 3 M 
tetramethylammonium chloride (TMAC1), 2mM control oligonucleotide Bl (supplied by 
Afiymetrix), 5X Denhardt's solution, 100 ng/ml herring sperm DNA, 5 mM EDTA pH 8.0, 10 mM 
10 Tris pH7.8, and 0.01% Tween 20 in a volume of 135 yl and is denatured for 10 min at 95°C. After 
two minutes on ice, the samples are loaded into HuSNP chips and hybridized for 16 h at 44°C and 
40 r.p.m. 

Each chip is washed and stained on the Afiymetrix fluidics. Chips are washed for two 
cycles of two mixes with 6X SSPET (Bio Whitaker, Walkersville, MD) (6X SSPE (sodium 

1 5 chloride, sodium phosphate, sodium EDTA) + 0.01% Triton-X-1 00) at 25°C, and for six cycles of 
five mixes with 4X SSPET (4X SSPE + 0.01% Triton X-100) at 35°C. Chips are stained for 30 min 
at 25°C with 50 jig/ml streptavidin-phycoerhthrin and 0.25 mg/ml biotinylated anti-streptavidin 
antibody in 6X SSPE, IX Denhardt's solution, and 0.01% Tween 20 in a volume of 500 pi. The 
chip is filled with 6X SSPET following six washes of four mixes with 6X SSPET at 25°C. 

20 After the hybridization, washing, and staining procedures, the HuSNP probe arrays are 

scanned using the HP GeneArray Scanner (HuSNP Mapping Assay Manual Afiymetrix P/N 
700308). 

Scanning 

25 

The HuSNP probe arrays are scanned using the HP GeneArray Scanner according to the 
HuSNP Mapping Assay Manual (Afiymetrix P/N 700308). Other scanners may be used including 
but not limited to the AlphaArray™ Reader. Genotype calls are made automatically from the 
collected hybridization signal intensities by the Asymetrix Microarray Suite version 5.0 software. 

30 Each allele of a SNP is represented by four or five complementary probes with different locations 
of the SNP base position within the 20-nucleotide probes. Each of these probes, in turn, is paired 
with a probe of the same sequence except for a central mismatch at or near a SNP position, intended 
to correct the fluorescence value for non-specific binding to the probe. 

Each SNP is genotyped. SNPs located on chromosomes 13 and 21, wherein the maternal 

3 5 DNA is homozygous, are analyzed with the DNA isolated from the plasma. 
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Analysis of DNA Isolated from Maternal Plasma 

After the maternal DNA is analyzed and homozygous SNPs are identified, these SNPs are 
analyzed with the DNA isolated from the plasma. A low copy number of fetal genomes typically is 
present in the maternal plasma. To increase the copy number of the loci of interest, which are the 
SNPs at which the maternal DNA is homozygous, primers are designed to anneal at approximately 
130 bases upstream and 130 bases downstream of each loci of interest. This is done to reduce 
statistical sampling error that can occur when working with a low number df genomes, which can 
influence the ratio of one allele to another (see Example 1 1). 

Design of Multiplex Primers 

The primers are 12 bases in length. However, primers of any length can be used including 
but not limited to 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 
26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36-45, 46-55, 56-65, 66-75, 76-85, 86-95, 96-105, 106-115, 
116-125, and greater than 125 bases. Primers are designed to anneal to both the sense strand and 
the antisense strand. 

The maternal homozygous SNPs vary from sample to sample so defined sequences are not 
provided here. Primers are designed to anneal about 130 bases upstream and downstream of the 
maternal homozygous SNPs. The primers are designed to terminate at the 3' end in the dinucleotide 
"AA" to reduce the formation of primer-dimers. However, the primers can be designed to end in 
any of the four nucleotides and in any combination of the four nucleotides. 

Multiplex PCR 

Regions upstream and downstream of the maternal homozygous SNPs are amplified from 
the template genomic DNA using the polymerase chain reaction (PCR, U.S. Patent Nos. 4,683,195 
and 4,683,202, incorporated herein by reference). This PCR reaction uses primers that anneal 
approximately 130 bases upstream and downstream of each loci of interest The primers are mixed 
together and are used in a single reaction to amplify the template DNA. This reaction is done to 
increase the number of copies of the loci of interest, which eliminates error generated from a low 
number of genomes. 

For increased specificity, a "hot-start" PCR reaction is used. PCR reactions are performed 
using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). The amount 
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of template DNA and primer per reaction is optimized for each locus of interest In this example, 
the 20 fil of plasma template DNA is used. 

Two microliters of each forward and reverse primer, at concentrations of 5 mM are pooled 
into a single microcentrifuge tube and mixed. Four microliters of the primer mix is used in a total 
5 PCR reaction volume of 50 fil (20jil of template plasma DNA, 1 \il of sterile water, 4 |il of primer 
mix, and 25 pi of HotStar Taq. Twenty-five cycles of PCR are performed. The following PCR 
conditions are used: 

(1) 95°C for 15 minutes; 

(2) 95°C for 30 second; 
10 (3) 4°C for 30 seconds; 

(4) 37°C for 30 seconds; 

(5) Repeat steps 2-4 twenty-four (24) times; 

(6) 72°C for 10 minutes. 

The temperatures and times for denaturing, annealing, and extension, are optimized by 
1 5 trying various settings and using the parameters that yield the best results. 

Other methods of genomic amplification can also be used to increase the copy number of 
the loci of interest including but not limited to primer extension preamplification (PEP) (Zhang et 
aL 9 PNAS, 89:5847-51, 1992), degenerate oligonucleotide primed PCR (DOP-PCR) (Telenius, et 
aL, Genomics 13:718-25, 1992), strand displacement amplification using DNA polymerase from 
20 bacteriophage 29, which undergoes rolling circle replication (Dean et <al., Genomic Research 
1 1:1095-99, 2001), multiple displacement amplification (U.S. Patent 6,124,120), REPLI-g™ Whole 
Genome Amplification kits, and Tagged PCR. 

It is important to ensure that the region amplified contains annealing sequences for the 
primers that are used with the HuSNP assay. Upon purchase of the HuSNP array, each SNP and the 
25 primers used to amplify each SNP can be identified. With this knowledge, the multiplex primers 
are designed to encompass annealing regions for the primers in the HuSNP Array. 

Purification of Fragment of Interest 

30 The unused primers, and nucleotides are removed from the reaction by using Qiagen 

MinElute PCR purification kits (Qiagen, Catalog Number 28004). The reactions are performed 
following the manufacturer's instructions supplied with the columns. The DNA is eluted in 100 |nl 
of sterile water. 5 |xl of each amplified loci is mixed together 

35 HuSNP Assay, Washing, Staining and Scanning 
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The pooled DNA is assayed with the HuSNP Array as described above. Washing, staining, 
and scanning procedures are as described above. 

Each SNP is genotyped. SNPs located on chromosomes 13 and 21, wherein the maternal 
DNA is homozygous, and DNA isolated from the plasma is heterozygous are quantitated. 

Quantification 

The intensity of the signal for each allele at a hetero2ygous is SNP is quantitated. As 
discussed above, the expected ratio of allele 1 to allele 2 can be used to determine the presence or 
absence of a chromosomal abnormality. If the maternal genome is homozygous at SNP X (A/A), 
and the plasma DNA is heterozygous at SNP X (A/G), then the G represents the distinct fetal 
signal. The ratio of G:A depends on the percentage of fetal DNA present in the maternal blood. 

For example, if the sample contains 50% fetal DNA, then the expected ratio is 0.33 (1 fetal 
G allele/(2 maternal A alleles + 1 fetal A allele)). This ratio should be constant for all 
chromosomes that are present in two copies. The ratio that is obtained for SNPs on chromosome 13 
should be the same as the ratio that is obtained for chromosome 21. 

However, if the fetal genome contains an additional copy of chromosome 21, then the ratio 
for this chromosome will deviate from the expected ratio. The expected ratio for a Trisomy 
condition with 50% fetal DNA in the maternal blood is 0.25. Thus, by analyzing SNPs wherein the 
maternal genome is homozygous, and the DNA that is isolated from the plasma is heterozygous, 
fetal chromosomal abnormalities can be detected. 

This example explained the use of Affymetrix HuSNP Arrays, but it not intended to limit 
the use of arrays. Any DNA array may be used including but not limited to the DNA ajnrays listed 
in Table XXIV, or DNA arrays available from any of the companies listed in Table XXIV, or XXV. 
Custom DNA arrays can be made for detecting fetal chromosomal abnormalities in maternal blood 
using any number of products or services including but not limited to those listed in Tables XXTV, 
XXV, XXVI and XXVfl. 



Table XXTV: Features of some hybridization microarray formats. 



Company 


Product Name 


Array Method 


Hybridization 
Step 


Readout 


AfJVmetrix. 
Inc.. 


GeneChip® 


In situ (on-chip) 
photolithographic 


10,000-260,000 
oligo features 


Fluorescence 
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synthesis of -20- 


probed with 








25-mer oligos 


labeled 30-40 








onto silicon 


nucleotide 








wafers, which are 


fragments of 








cut into 1.25 


sample cDNA 








cm 2 or 5.25 cm 2 


or antisense 








chips 


RNA 




Agilent 


DNA 


Custom Designed 






Technologies 


Microarray 

Access 

Program 


Arrays 






Amersham 


CodeLink 








Biosciences 


Arrays 








BD 


RiboScreen 








PharMimgen 


Human- 1 
Membrane 








CLONTECH 


Atlas Glass 
Human 


Variable 


Variable 


Variable 




Atlas Glass 






• 




Human 1.2 










Atlas Glass 










Human 1.2 II 










Atlas Glass 










Human 1.2 m 










Atlas Human 










Cancer 










Atlas Human 










Apoptosis 










Atlas Human 










Cancer cDNA 










Atlas Human 










Cardiovascular 










Atlas Human 










cDNA 






i 




Atlas Human 
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cell cycle 
Atlas Human 
Cell Interaction 
Atlas Human 
Cytokine 
Receptor 
Atlas Human 
Hematology 
array 

Atlas Human 
Neurobiology 
Atlas Human 
Oncogene 
Atlas Human 
Stress array 
Atlas Human 
Toxicology 
Atlas Human 
Tumor array 
Atlas Human 
Oncogene 
Custom Atlas 
arrays 

Custom Atlas 

OVlvWl CLLXCLj & 






• 

• 


Brax, 




Short synthetic 
oligo, 

synthesized off- 
chip 


1000 oligoson 
a "universal 
chip" probed 
with tagged 
nucleic acid 


Mass 

spectrometry 
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Display 


discoveryArray 


Uses restriction 






Systems 


Human Gene 


fragment 






Biotech 


Display slide 


differential 










display PCR 






Gene Logic, 


READS 1 * 1 


Flow-thru Chip 


Chips with 




Inc., 




Probe Arrays 


microscopic 










channels with 










probes attached 










to their walls 






Star Profiler 




















Arrays 












SNP screening 






Genometrix 


Universal 


services, 96-well 






Inc., 


Arrays™ 


X 250 proprietary 










array 






Genomic 


GenMAP 


Cancer related 






Solutions 


Cancer Chips 


sequences spotted 










in duplicate 






Hvseq Inc., 


HyChip™ 


500-2000 nt 


64 sample 


Radioisotope 






DNA samples 


cDNA spots 








printed onto 0.6 


probed with 








cm 2 (HyGnostics) 


8,000 7-mer 








or -18 cm 2 (Gene 


oligos 








Discovery) 


(HyGnostics) 








membranes 












<-55,000 










sample cDNA 










spots probed 










with 300 7-mer 










oligo (Gene 










Discovery) 








Fabricated 5-mer 


Universal 1024 


Fluorescence 






oligos printed as 


oligo spots 








1,15 cm 2 arrays 


probed 10 kb 
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onto glass 
(HyChip) 


sample cDNAs, 
labeled 5-mer 
oligo, and 
ligase 




Incvte 

Pharmaceutica 


GEM 


Piezoelectric 
printing for 
spotting PCR 
fragments and 
on-chip synthesis 
of oligos 


<=1000 
(eventually 
10,000) 
oligo/PCR 
fragment spots 
probed with 
labeled RNA 


Fluorescence and 
radioisotope 


Is. Inc. 


INTERACTI 
VIA 




Gold affinity 
array 






Mergen 




ExpressChip 
DNA microarray 
system 






Molecular 
Dynamics, Inc 


Storm® 
Fluorlmager® 


500-5000 nt 
cDNAs printed 
by pen onto -10 
cm 2 on glass slide 


-10,000 cDNA 
spots probed 
with 200-400 
nt labeled 
sample cDNAs 


Fluorescence 


Motorola 


CodeLink 
Array 








MWG- 
Biotech 


Biochip 
service, 
licensed by 
Affymatrix 








NEN Life 

Science 

Products 


MICROMAX 
Human cDNA 








MICROMAX 
Human Kinases 
and 

phosphatases 


MICROMAX 
human 
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transcription 
factors 






• 


MCROMAX 
human 

oncogene and 
tumor 
suppressor 
genes 


Operon 
Technologies 


OpArray 

Human 

Collagen 








OpArray 

Human 

apoptosis 


OpArray 
Human stress 
and aging 


Origene 
Technologies 


SmartArray 
Human 
Smartset 1 








SmartArray 
Human 
Smartset 2 


SmartArray 
Human 
Smartset 3 


SmartArray 
Human 
Smartset 4 


Nanogen 


Semiconductor 
Microchip 


Prefabricated 
~20~mer oligos, 
captured onto 
electroactive 
spots on silicon 
wafers, which are 


25, 64,400 
(and eventually 
10,000) oligo 
spots polarized 
to enhance 
hybridization to 


Fluorescence 
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diced into <=1 
cm 2 chips 


200-400 nt 
labeled sample 
cDNAs 




Phase- 1 

Molecular 

Toxicology 


Human 350 
Gene Array 








ProtoGene 
Laboratories 


DNA FlexChip 
technology 


Surface tension 
based arraying; 

On-chip synthesis 
of40-50-mer 
oligos onto 9 cm 2 
glass chip via 
printing to a 
surface-tension 
array 


<=8,000 oligo 
spots probed 
with 200-400 
nt labeled 
sample nucleic 
acids 


Fluorescence 


R&D Systems 


Human 
Cytokine 
Expression 
Array 






• 


Human 
Apoptosis 
Expression 
Array 


Research 
Genetics 


Human 
Genefilters 
Micorarray 
Releases I-VII 








Human 

"Named 

Genes" 

Genefilters 

Micorarray 

Releases I 


Human 
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Prostate 
specific 
Genefilters 
Micorarray 
Releases I 








Human Ovary 
Specific 
Genefilters 
Micorarray 


Human Breast 
specific 
Genefilters 
Micorarray 


Human Colon 
specific 
Genefilters 
Micorarray 
Releases I 


Dermarray 
Human Skin 
Genefilters 
Micorarray 


Radius 
Biosciences 


Custom 

Arraying 

Services 








Rosetta 
Impharmatics 


FlexJet DNA 
Microarrays; 
Resolver™ 








Sequenom 


MassArray 
SpectroChip 


Off-set printing 
of array; around 
20-25-mer oligos 


250 locations 
per 

SpectroChip 
interrogated by 
laser desorbtion 
and mass 
spectrometry 


Mass 

spectrometry 
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SEQWRIGHT 


Arraying 
Services 








Sigma- 
Genosys 


Panorama 
Human 

Cytokine Gene 
Arrays 








Panorama 
Human 
Apoptosis 
Gene Arrays 


Stratagene 


GeneConnectio 
n Discovery 
Microarray 








SuperArray 


Human 

Apoptosis-1, 2, 
3, GEArray Kit 






i ! 


Human 
Apoptosis-4, 
Bcl-2 Family 
and Regulators 
GEArray Kit 


Human 
Apoptosis- 
5/TNF andRas 
Network 
GEArray Kit 


Human p53 
GEArray Kit 


Human 

Toxicity/Stress 
GEArray Kit 


Human Cell 
Cycle 1, 2 
GEArray Kit 


Human Cancer 
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Metastasis 

GEArrayKit 

Human 

Cancer/Oncoge 
ne GEArray 
Kit 

Human 

Cancer/Tumor 

Suppressor 

GEArrayKit 

Human 

Cancer/Angiog 
enesis GE 
Array Kit 
Human 

Cancer/Radiati 
on Sensitivity 
Marker 
GEArray Kit 
Human 

PathwayFinder 
GEArray Kit 
Human Jak- 
Stat Pathway 
GEArrayKit 
Human P13 
Kinase and 
AKT Pathway 
GEArrayKit 
Human NFkB 
Pathway 
GEArrayKit 
Human 
Common 
Cytokine 
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GEArrayKit 

Human 

Interleukin 

Receptor 

GEArrayKit 

Human 

Inflammatory 

Response 

GEArrayKit 

Human Choice 

GEArrays 








Svnteni. Inc., 
(acquired by 
Incvte 

Pharmaceutica 
Is. Inc.} 


UniGEM™ 


500-5,000 nt 
cDNAs printed 
by tip onto ~4 
cm 2 glass chip 


<=10,000 
cDNA spots 
probed with 
200-400 nt 
labeled sample 
cDNAs 


Fluorescence 


TeleChem 

International, 

Inc. 


Flex-Chips 
Custom Arrays 
eChips Custom 
Arrays 

Discovery Chip 
Arrays 








Vysis Inc. 


AmpliOnc 
Microarray kit 








The German 

Cancer 

Institute 




Prototypic PNA 
macrochip with 
on-chip synthesis 
of probes using f- 
moc or t-moc 
chemistry 


Around 1,000 
spots on a 8 x 
12 cm chip 


Fluorescence/mas 
s spectrometry 



Table XXV: Companies that produce arrays, or devices and instrumentations involved in the 
production of arrays. 
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COMPANY 


PRODUCT/RESEARCH 


Website Address 


ACLARA 
Biosciences, Inc. 


Plastic chips and 
microfluidic systems based 
on "Lab-On-A-Chip" 
microfluidics US Patent 
5,750,015: "Method and 
device for moving 
molecules by the 
application of a plurality of 
electrical fields"). 




Advanced Array 
Technology S.A. 


BIO-CD™: compact disc 
platform for DNA detection 


http://www.aat-arrav.com 




Affymetrix, Inc., 


GeneChip^arrays, 
including HIV, p450, p53, 
Rat Toxicology U34 arrays, 


http://www.affymetrix.com 




Agilent 

Technologies, Inc 


A subsidiary of Hewlett- 
Packard Company, plans to 
expand its presence in the 
life science market through 
the introduction of a new 
DNA microarray program. 
It uses inkjet printing 
technology to manufacture 
its oligo-based DNA 
microarrays. Licensed from 
Ed Southern/OGT. 
LabChip™-based DNA and 
RNA bioanalyzer 


http://www.agilent.com 




Alpha Innotech 
Corp. 


Alpha Innotech provides 
innovation bioinformatic 
imaging solutions for 
genetic discovery designed 
to acquire, manage, and 
analyze fluorescence, 
chemiluminescence, or 


http://www.alpha-tec.net 
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colorimetric microarray 
slides, plates, gels, blots, or 
films. 




Amersham 
Biosciences 


Lucidea array spotter, 
automated slide processor, 
scanner, spotfinder, scoring 
system 


http://www.apbiotech.com 


AlphaGene, Inc. 


Full length cDNA FLEX™ 
and MicroFLEX library 
construction; High 
Throughput Gene 
Expression Profiling; High 
Throughput DNA 
Sequencing; Bioinformatics 




Applied Precision, 
Inc. 


ArrayWoRx is a wide field 
light source based 
microarray scanner, 
combines limitless 
wavelength possibilities 
with automation and image 
processing software. 




Asper Ltd. 


Arrayed Primer Extension 
(APEX) and Asper 
ChipReader 003 




AVIVA Biosciences 
Corp. 


Dedicated to the 
application of breakthrough 
multiple-force biochip 
technology for genomics 
and proteomics. The 
company is developing an 
integrated sample-to-result 
AVIChip™ system with an 
emphasis on biological 
sample preparation and 
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chip-based molecular 






manipulation. The 






AVIChip™ system will 






separate and transport a 






variety of mRNA, or other 






molecules from crude 






biological samples and 






simultaneously perform a 






wide range of biological 






and biochemical analyses. 






AVIVA's technology 






allows fast, accurate, 






automated, and high- 






throughput biological 






analysis on integrated 






biochip systems and 






provides novel approaches 






to both drug development 






and clinical diagnostics 




Axon Instruments, 


Integrated Microarray 




Inc. 


Scanner and Analysis 






Software, simultaneously 






scans microarray slides at 






two wavelengths using a 






dual laser scanning system, 






displays images from two 






wavelengths and a ratio 






image as they are acquired 






in real time; US$50,000) 




BioArray Solutions, 


Light-controlled 




LLC 


Electrokinetic Assembly of 






Particles near Surfaces 






(LEAPS), enables 






computer controlled 
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assembly of beads and cells 
into planar arrays within a 
miniaturized, enclosed fluid 
compartment on the surface 
of a semiconductor wafer. 




BioCat 


Distributor of catalogue 
and custom arrays 


http://biocat.de 


bioDevice Partners, 


Provides consulting 
services to the 
microarraying community 
in the area of optics and 
instrumentation 


i 


BioDiscovery, Inc. 


ImaGene™, special image 
processing and data 
extraction software; 
CloneTracker: Databases 
clones, plates, and slides, 
and offers array design tool 
and interfaces to arrayers; 
GeneSight: Powerful 
expression analysis 
software which features 
statistical methods as well a 
visualization tools. 


! 


Biomedical 
Photometries, Inc., 


MACROscope™ for 
reading genetic 
microarrays, in 
collaboration with 
Canadian Genetic 
Microarray Consortium 




BioRobotics Ltd. 


MicroGrid, for arraying 
oligonucleotides or cDNA 
clones on glass slides and 
plastic chips) 




Caliper 


LabChips™ based on 
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Technologies Corp 


microfluidics 




Capital Biochip Corp 


Created to develop and 
commercialize various 
biochip technologies. 




Cartesian 
Technologies, Inc. 


PixSys PA Series: for 
Automated liquid handling 
system for creating high- 
density arrays for genomics 
research. Scan Array 3000: 
A Fluorescent Imaging 
System for microarray 


i 


Cellomics, Inc. 


ArrayScan™, cell-based 
"High Content Screening" 
(HCS) for drug discovery 




v^epneia 


Microfluidics 




Clinical Micro 
Sensors, Inc. 


DNA microchip-based 
medical diagnostics; 
detection of directly detect 
DNA via electron transfer. 




Clondiag Chip 
Technologies 


Working on generation and 
application of DNA 
microarrays 




Clontech's Atlas™ 
human cDNA array 


Sells various human arrays 


http://www.clontech.com 




CombiMatrix 
Corporation 


CombiMatrix' core 
technology is the Lab-on-a- 
Chip integrated circuit. 
These integrated circuits 
contain arrays of 
microelectrodes that are 
individually addressable 
using logic circuitry on the 
chip. 
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Compugen 


LEADS 1 M drug discovery 
platform for identifying 
drug targets based on the 
analysis of EST (Expressed 
Sequence Tag) and 
genomic databases, 
expression results from 
chips and proteomics, and 
polymorphism detection 
and qualification; DNA 
chip design and analysis. 




Corning Science 
Products Division 


Provides the Corning 
Microarray Technology 
CMT-GAPS amino silane 
coated slides and CMT- 
Hybridization chamber. 




Cruachem Ltd, U.K 


Manufactures the 
phosphoramidite building 
blocks for the synthesis of 
DNA. 




CuraGen Corp. 


GeneCalling™ and 
Quantitative Expression 
Analysis (QEA™), 
CuraMode, CuraTox 




diaDexus, LLC 


Specialized in using 
microarray technology for 
molecular diagnostics 


■ 


Display Systems 
Biotech, Inc, 


discoveryARRAY slides 
(over 2400 expressed 
cDNA fragments); will 
soon offer over 40,000 
arrayed mouse and human 
genes 




DNAmicroarray.com 


Offers complete "made to 
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• 


order" high density DNA 
microarray synthesis and 
analysis services. 




Erie Scientific 
Company 


Manufactures microslides 
for microarrays. 




Exiqon 


Microarray slides and chips 


http://www.exiqon.com 


Expression Analysis 
Inc 


Company was formed to 
provide GeneChip 
processing and gene 
expression analysis using 
Affymetrix GeneChip 
microarrays 




Febit 


System for gene expression 
profiling or genotyping 


http://febit.com 


Gene Logic, Inc. 


Flow-thru Chip™: has 
hundreds of thousands of 
discrete microscopic 
channels that pass 
completely through it. 
Probe molecules are 
attached to the inner 
surface of these channels, 
and target molecules flow 
through the channels, 
coming into close 
proximity to the probes. 
This proximity facilitates 
hybridization. READS™, 
Restriction Enzyme 
Analysis of Differentially- 
expressed Sequences, for 
capturing and analyzing the 
overall gene expression 
profile of a given cell or 
tissue type to identify drug 
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targets 




Geneka 

Biotechnology Inc. 


Oligonucleotide-based 
microarray slide, the 
P.R.O.M. (Proteomic 
Regulatory Oligonucleotide 
Microarray). 35-45-mers 




Genemachines 
Genomic 
Instrumentation 
Services, Inc. 


OmniGrid, glass slides or 
nylon membranes 




GeneScan 


Biochip technology; high 
throughput production of 
biochips 


http://www.eenescan-europe.com 




General Scanning 
Inc 


Laser scanning and 
micropositioning, 
manufactures MicroArray 
Biochip Scanning System: 
ScanAnray™). Now called 
GSI Lumonics 


i 


Genisphere 


Provides fluorescently- 
labeled kits for gene 
expression arrays. Uses 
highly branched nucleic 
acids - dendrimer 
technology. 


http://www.eenisphere.com 


Genetic 

MicroSystems Inc. 


Instrumentation for DNA 
microarray-based analysis. 
Acquired by Affymetrix 




Genicon Sciences 
Corp 


Developed an ultra- 
sensitive signal generation 
and detection platform 
technology based on 
Resonance Light Scattering 
(RLS) for the simple and 
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efficient detection, 
measurement and analysis 
of biological interactions 




Genometrix Inc. 


Bioscanner™, 

Gene View®, Universal 

Arrays™, Risk-Tox 




Genomic Solutions, 


Flexys™ modular robotic 
system, GeneTAC™ and 
Genomic Integrator™ array 
analysis products automates 
the imaging and analysis of 
gene microarrays 


http://www.penomicsolutions.com 


GENPAKInc 


genpakARRAY21 robotic 
microarrayer system and 
genSTATION 3XL manual 
microarrayer system. 


! 


GeSiM 


The Nano-Plotter is based 
on piezoelectric pipetting 
principle 




Genzyme Molecular 
Oncology 


SAGE®: Serial Analysis of 
Gene Expression 




Greiner Bio-one 


Ready to use biochip kits 
for genotyping and SNP 
detection, modified slides 
for microarraying 


http://www.greinerbioone.com 


HP GeneArray 
Scanner 


Used by Affymetrix and 
others 




Hypromatrix, Inc. 

• 


Hypromatrix 
AntibodyArray TM is 
designed to detect protein- j 
protein interactions, post- 
translational modification 
and protein expression. 




Hyseq Inc. 


Sequencing By 
Hybridization. HyX 
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platform and Gene 
Discovery, HyGnostics, 
and HyChip™ modules 




Alumina, Inc. 


Utilizes fiber optics, 
microfabrication, and 
advanced information 
processing to create arrays 
where 250,000 discrete 
sensors fit on a probe the 
diameter of the head of a 
pin. Technology: 
"BeadArray." 


http://www.illumina.com 




Incyte Genomics, 
Inc. 


GEM Microarrays, 
GeneJet™ array, LifeSeq® 
Database with estimated 
100,000 genes, and 
LifeArray Microarray 
Software. 




IntegriDerm, Inc 


Produces DermArray DNA 
microarrays for 
dermatologic research 




JMAR's Precision 
Systems, Inc 


Designer and manufacturer 
of UV exposure and mask 
aligner systems specifically 
designed for bio-chip 
manufacturers. Also 
produces custom 
micropositioning systems 
for micro-spotting 
equipment and high 
resolution dimensional 
metrology and defect 
inspection systems for 
quality assurance of bio- 
chips and DNA 


i 
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microarrays 




Lab-on-a-Chip.com 


Provides focused 
information on all Lab-on- 
a-Chip technologies. It 
includes published papers, 
news, events, new products, 
suppliers, research links, 
jobs and discussion forums. 




Labman Automation 
Ltd., 


HDMS: Labman High- 
Density Microarray Spotter 




Lambda 


Ready to use biochip kits 
for genotyping and SNP 
detection 


htto://www.lambda.at 


Lifecodes Corp. 


Lifecodes MicroArray 
System: LMAS 


i 


Lynx 


Megasort™ is a bead-based 
process providing 
differential DNA analysis 




Medway 


MEDWAY designs, 
develops, manufactures and 
commercialises medical 
devices for diagnostics, 
robotic systems, optical 
instruments, fluorescent 
molecular markers, sieving 
microchips 




Mergen Ltd 


ExpressChip™ 
oligonucleotide microarray 




Memorec Stoffel 


PIQOR technology for 
producing custom and 
generic murine and human 
cDN A arrays 


http://www.memorec.com 


MetriGenixInc 


The 4D Array utilizes a 
patented flow through 
design that optimizes the 
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surface area to volume 
ratio, has shorter 
hybridization times, 
provides larger 
binding/signal capacity, 
and is more readily 
automated than flat 
biochips 




Micralyne Inc. 


Fabricates micromachined 
glass, silicon and thin film 
components for use in 
microfluidics. 




MicroFab 
Technologies, Inc. 


Manufactures piezoelectric 
drop-on-demand ink-jet 
printing technology for 
microdispensing fluids. 




Micronics, Inc 


Microfluidics based 
systems for application to 
clinical laboratory 
diagnostics: 
Microcytometer™, H- 
Filter™, T-Sensor™, and 
O.R.C.A. nFluidics 




Molecular 
Dynamics, Inc 


Storm® and Fluorlmager® 




Molecular Tool, Inc 


Genetic Bit Analysis, 
GBA®' Genomatic™. 
Acquired by Orchid 
Biocomputer on September 
14, 1998. 




Mosaic 

Technologies, Inc., 


fcZ-KAYS activated slide 
kits for DNA microarrays 




Motorola BioChip 
Systems 


Licensed a 3-D gel pad 
technology from Argonne 
National Laboratory; 


htto://www.motorola.com/lifescien 


ces 
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CodeLink Array System. 




MWG Biotech 


Oligo-based microarrays, 
custom chips, automated 
systems 


http://mwsatccn.mwpdna.com 


Nanolytics 


Developing Custom Array 
Synthesis Technology 




Nanogen 


Electronic Addressing, 
Concentration, and 
Hybridization; NanoChip 
automated workstations for 
SNP and STR analysis 


http://www.nanogen.com 


NEN Life Science 
Products 


MICROMAX™ Human 
cDNA Microarray System I 
for differential gene 
expression analysis 




Operon 

Technologies, Inc. 


Low density (320 or 370 
genes, 70-mers) 
OpArrays™ microarrays. 




Orchid Biosciences, 
Inc. 


SNP-IT technology; 
Microfluidic chips; 
applying microfabrication 
processes in glass, silicon, 
and other materials to 
create three dimensional 
structures. Contained 
within these devices are 
small capillary channels 
less than a millimeter wide. 


http://www.orchid.com 

• 


OriGene 

Technologies Inc. 


Offers SmartArray™ chips 
(Huamn), including nuclear 
hormone receptors, 
homeobox/b-zip/HLH 
transciption factors, tissue- 
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specific/inducible 
transcription factors , and 
phosphotyrosine Kinases 




Oxford Gene 
Technology Ltd 


Oliffo-based microarrav 


h tin* //www offt co uk 

XIMUJ ml I TV TT TT I VtliVV/tUA 

1 


Packard Bioscience 
(Division of 
PerkinElmer) 


Hydrogel-coated slides, 
arrayers, scanner and data 
analysis software 


httn*//www nackardbioscience com 

UllUif f TV TV TT a UUVAU1 UUlVUVlVllvW i 


Packard Instrument 
Company 


BioChip Arrayer 




PamGene B.V. 


Flow-through technology 
for microarray 


http://www.pamgene.com 


PanVera 


Intelligence Microarrays 


http://www.panvera.com 


PE Applied 
Biosy stems 


TaqMan Assay, SYBER 
Green I dye Assay; 
Integrated, Micro-Sample 
Preparation System for 
Genetic Analysis 


http://lifesciences.perkinelmer.com 


PharmaSeq, Inc 


Multiplex DNA Diagnostic 
Assay Based on 
Microtransponder 




Phase-1 Molecular 

TVvjcieoloorv Ttift 


Molecular and high 

thrnn all nut tnvipnlnov 

using gene chips (Licensed 
from Xenometrix) 




PicoRapid 
Technologie 


Picoarrays. PicoSlides and 
microarray spotting service 


http://www.picorapid.de 




Protogene 
Laboratories 


Surface tension array on 
glass substrate; "Printing" 
reagents using drop-on- 
demand technology. 




Qiagen Operon 


Custom arrays and Array- 
Ready oligo sets 


http://www.operon.com 
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R&D Systems 


Cytokine Expression Array 
allows one to determine the 
RNA level for 
approximately 400 
cytokines and related 
factors in one standard 
hybridization experiment, 
(charged nylon membrane) 


i 


Radius Biosciences 


Custom DNA, RNA, PNA, 
and Protein MicroArray 
Chips 




RELABAG 


Developing BioChip arrays 
for diagnostic applications 
(oncology). The GeneStick 
platform with arrays on 
plastic sticks and a new 
chemiluminescence imager. 




Invitrogen 


GeneFilters 
microarrays/VastArray 
tissue arrays/ GeneStorm 
Expression Ready full- 
length clones 


V\ i i t\ • f/wrwrwi TP*QCTf*n r»r\m 
11 Lip. // W W W.lCo^Cil.l/UIIl 


RoboDesign 

International Tnc 


Its RoboArrayer is 
integrated with a vision 
system to allow for real- 
time quantification of spot 
size and spot volume 
during the printing process. 




Rosetta Inpharmatics 


FlexJet™ DNA 
oligonucleotides 
microarrays (in-situ 
synthesized on a glass 
support via ink-jet printing 
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process); Resolver™ 
Expression Data Analysis 
System. 




Scienion 


Generic and customized 
arrays 


http://www.scienion.de 




SciMatrix, Inc 


Offers Array Works™, a 
complete line of custom 
microarray services, for the 
production, processing, and 
analysis of microarrays, 
using PixSys™ arrayers 
from Cartesian 
Technologies. It also 
provides customized 
ArrayEngine™ microarray 
systems. 




Sequenom 


DNA MassArray, 
BiomassPROBE, Biomass 
SIZE, 

BiomassSEQUENCE, 
BiomassSCAN, 
BiomassINDEX, and 
SpectroChip 




Sigma-Genosys Ltd 


Panorama™!?, coli Gene 
Arrays, 4,290 genes per 
array 


http://www.siema-eenosvs.com 




SmartBead 
Technologies 


3D UltraPlex™ microarray 
technology for applications 
in SNP genotyping and 
gene expression 


http://www.smartbead.com 


SuperArray Inc 


Their gene expression array 
(GEArray™ ) systems 
(Human and mouse) are 


http://www.superarrav.com 
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designed for pathway- 
specific gene expression 
profiling 




SurModics, Inc 


Manufactures 3D-Link AM 
activated slides for the 
production of microarrays. 
Uses amine-modified DNA 
to hybridize on the surface 
of the slide 




Synteni, Inc. 


UniGEM™ Gene 
Expression Microarray 




TeleChem 
International 


Offers whole system parts: 
ChipMaker, SmartChips, 
Arraylt, Hybridization 
Cassette, ScanArray 3000, 
ImaGene Quantification 
Software, and Super 
Microarray Substrates 




Third Wave 
Technologies, Inc 


Develops and 
commercializes simple, , 
low-cost nucleic acid 
platform technologies to 
fundamentally alter disease 
discovery, diagnosis and 
treatment Invader® assay 
and CFLP® Technology 




Tissue Array 


Expression study of protein 
and in situ screening of 
mRNA. 




VBC Genomics 


Custom oligonucleotide 
microarrays and Aflymetrix 
GeneChip Services 


http://www.vbc-genomics.com 


V&P Scientific, Inc 


Supplies replicators that 
will make macroarrays on 
membranes, or microarrays 
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on slides. 




Virtek Vision 
International Inc 


ChipReader™ is a high- 
sensitivity laser confocal 
system for rapid imaging of 
the DNA microarrays. 




Vysis, Inc. 


CGH-Comparative 
Genomic Hybridization; 
The GenoSensor 
Microarray System 
includes genomic 
microarrays, reagents, 
instrumentation and 
analysis software. 


http://www.wsis.com 


Xanthon 


Developed a multiplexed, 
microplate-based 
electrochemical detection 
system for high-throughput 
screening of compounds for 
their effects on gene 
expression. Based on 
measurement of the 
oxidation of guanine on an 
electrode 


- 


Xenometrix, Inc. 


Gene Profile Assay and 
bioinformatics for gene 
induction profile analysis. 




XENOPORE Corp 


Manufacturer of coated 
microscope slides, 
including silanated, 
silylated, epoxy, 
streptavidin, nickel chelate, 
and many other surfaces. 


4> 



Table XXVI: Array Databases and On-line Tools 
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Database 


Overview 


GeneX 


A comprehensive list of is gene expression 
database and analvsis tools is available at 
NCGR's GeneX site 


Microarray Gene Expression Database (MGED) 
Group 


Microarray Gene Expression Database 
(MGED) Group, was formed to facilitate the 
adoption of standards for DNA-array 
experiment annotation and data 
representation, as well as the introduction of 
standard experimental controls and data 
normalisation methods. 


Gene Expression Omnibus (GEO 


NCBI ! s Gene Expression Omnibus fGEO) 
public gene expression repository in 
development. 


ArravDB 

('http://genome.nheri.nih.sov/arravdb/schema.htmn 
at the National Human Genome Research Institute 




(NHGRI) 


nAn-ay Center 


Nflti nnal Cancer Institute's arrav schema. 


expressDB. 


George Church Lab's at Harvard Medical 

School: a relational database containing yeast 

i 

RNA expression data. 


MAT 


Microarray Analysis Tool) at Albert Einstein 
College of Medicine: based on Java, JDBC, 
and Sybase SQL. 


GATC consortium's published schema 




GeneX: 


A Collaborative Internet Database and 
Toolset for Gene Expression Data at the 
National Center for Genome Resources 
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GenEx™ 


GenEx™ of Silicon Genetics is a public web 
database that allows scientists to freely 
distribute and visualize gene expression data 
(text and image) from microarrays, 
Affymetrix chips, and related technologies. It 
can also dynamically generate several graphs 
from the data being viewed, such as; scatter 
plots, trees, overlays, ordered lists, line 
graphs, or physical position graphs. It is 

HpqiotipH to ctfuv* fltinntfl'rinn^i and 

interpretations on finished experiments, and 
can access data from SQL databases like 
GATC or even from flat text files. 


Stanford MicroArray Database (Oracle) 




Arabidopsis cDNA Microarrav Results 


The Arabidopsis Functional Genomics 
Consortium CAFGCYs Arabidopsis cDNA 
Microarrav Results 


ArravExpress 


ArravExpress, being developed at the 
European Bioinformatics Institute, will be a 
public array-based gene expression data 
repository. An international meeting on 
Microarrav Gene Expression Databases, 


November 14-15, 1999. 


MicroArrav Explorer (MAExploreri. 


Dr. Peter Lemkin at the NCI developed a Java 
applet MicroArrav Explorer (MAExplorert, 
which is currently being used in the 
Mammarv Genome Anatomv Program 


CLUSFAVOR 


Dr. Leif Peterson's CLUSFAVOR: 
Partitioning Large-sample Microarray-based i 
Gene Expression Profiles Using Principal 
Components Analysis 
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SAGEmap 


SAGEmap: A Public Gene Expression 
Resource, Alex E. Lash et al., Genome Res. 
2000 July 1; 10(7): p. 1051-1060 


J-Express 


Java program for analyzing microarray data. 
SOM and PCA implemented, by Bjarte 
Dysvik. 


MicroArrav Informatics at the EBI 







Table XXVD: Microarray Databases on the World Wide Web 



Project 


Name 


Abberdeen university 


aeFDM 


AfiVmetrix & Molecular dvnamics 


Consorsium GATC 




Albert Einstein College of Medicine 


MAT 






Another Microarrav Database. ( Jan L 2000 - vl.01) 


AMAD 




Ecole Normale Superieure ( private database) 


LGM database 




Ecole Normale Sunerieure (public database) 


Y MGV 






European Bioinformatiaue Institute 


Arravexpress 








ExpressDB , 






Pevsner Lab's (Kennedv Krieeer Institute) 


Dragon 


Manchester bioinformatics 


maxd 




National center for Biotechnology Information fNCBD 


Geo 




NCI-LECB 


MAExplorer 


National Human Genome Research Institute (NIH) 


ArravDB 






National Cancer Institute Array Center server fNIH-CIT) 


fiArrav Database 






NIEHS-NDH 


| MAPS 


San Diego Supercomputer Center (SDSC) 


2HAPI 






Silicon Genetics 


GenEx 
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Stanford 


SMD 


University of Pennsylvania 


RAD 






Youne lab (White head institute-MIT) 


chipDB 



EXAMPLE 17 

i 

5 Fetal chromosomal abnormalities are determined by analyzing SNPs wherein the maternal 

template DNA is homozygous and the template DNA obtained from the plasma is heterozygous. 
Plasma that is isolated from blood of a pregnant female contains both maternal template DNA and 
fetal template DNA. Any number of SNP detection methods can be used to analyze the maternal 
and plasma DNA. In this example, SNPs are analyzed by the CodeLink SNP Bioarray System, 

10 which is manufactured in a collaborative effort by Motorola and Amersham Biosciences, however 
other microarrays may be used. Amersham sells the CodeLink P450 Bioarray, which is designed to 
genotype various regions of the P450, which is located on chromosome 6. 

However, Amersham may produce custom CodeLink arrays, which can be designed to 
analyze regions of any chromosome including chromosome 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 

15 14, 15, 16, 17, 18, 19, 20, 21, 22, X or Y. 

Collection of Blood Samples 

In accordance with an IRB approved study, blood samples are collected from pregnant 
women after informed consent is granted. Blood is collected into 9 ml EDTA Vacuette tubes 

20 (catalog number NC9897284) and 0.225 ml of 10% neutral buffered solution containing 

formaldehyde (4% w/v), is added to each tube, and each tube gently is inverted. The tubes are 
stored at 4°C until ready for processing. 

Any number of agents that impede cell lysis, stabilize cell membranes, or cross-link cell 
membranes can be added to the tubes including but not limited to formaldehyde, and derivatives of 

25 formaldehyde, formalin, glutaraldehyde, and derivatives of glutaraldehyde, crosslinkers, primary 
amine reactive crosslinkers, sulfhydryl reactive crosslinkers, sulfhydryl addition or disulfide 
reduction, carbohydrate reactive crosslinkers, carboxyl reactive crosslinkers, photoreactive 
crosslinkers, cleavable crosslinkers, AEDP, APG, BASED, BM(PEO) 3 , BM(PEO) 4 , BMB, BMDB, 
BMH, BMOE, BS3, BSOCOES, DFDNB, DMA, DMP, DMS, DPDPB, DSG, DSP, DSS, DST, 

30 DTBP, DTME, DTSSP, EGS, HBVS, sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS or compounds 
listed in Table XXIII. Any concentration of agent that stabilizes cell membranes, impedes cell 



339 



WO 2004/078994 



PCT/US2004/006337 



lysis, or cross-links cell membranes can be added. In a preferred embodiment, the agent that 
stabilizes cell membranes or impedes cell lysis is added at a concentration that does not impede or 
hinder subsequent reactions. 

An agent that stabilizes cell membranes may be added to the maternal blood sample to 
5 reduce maternal cell lysis including but not limited to aldehydes, urea formaldehyde, phenol 
formaldehyde, DMAE (dimethylaminoethanol), cholesterol, cholesterol derivatives, high 
concentrations of magnesium, vitamin E, and vitamin E derivatives, calcium, calcium gluconate, 
taurine, niacin, hydroxylamine derivatives, bimoclomol, sucrose, astaxanthin, glucose, 
amitriptyline, isomer A hopane tetral phenylacetate, isomer B hopane tetral phenylacetate, 
10 citicoline, inositol, vitamin B, vitamin B complex, cholesterol hemisuccinate, sorbitol, calcium, 
coenzyme Q, ubiquinone, vitamin K, vitamin K complex, menaquinone, zonegran, zinc, ginkgo 
biloba extract, diphenylhydantoin, perftoran, polyvinylpyrrolidone, phosphatidylserine, tegretol, 
PABA, disodium cromglycate, nedocromil sodium, phenytoin, zinc citrate, mexitil, dilantin, sodium 
hyaluronate, or polaxamer 188. 

15 

Isolation of Plasma and Maternal Cells 

The blood is stored at 4°C until processing. The tubes are spun at 1000 rpm for ten minutes 
in a centrifuge with braking power set at zero. The tubes are spun a second time at 1000 rpm for 
20 ten minutes. The supernatant (the plasma) of each sample is transferred to a new tube and spun at 
3000 rpm for ten minutes with the brake set at zero. The supernatant is transferred to a new tube 
and stored at -80°C. Approximately two milliliters of the "buffy coat," which contains maternal 
cells, is placed into a separate tube and stored at -80°C. 

25 Isolation of DNA 

DNA is isolated from the plasma sample using the Qiagen Midi Kit for purification of DNA 
from blood cells, following the manufacturer's instructions (QIAmp DNA Blood Midi Kit, Catalog 
number 5 1 183). DNA is eluted in 100 jutl of distilled water. The Qiagen Midi Kit also is used to 
isolate DNA from the maternal cells contained in the "buffy coat." 

30 

Identification of Homozygous Maternal SNPs 
CodeLink SNP Bioarray 
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CodeLink SNP is a bioarray system solution for broad based SNP genotyping. CodeLink 
SNP involves a multiplexed assay involving PCR amplification and amplicon fragmentation, 
followed by surface based enzymatic allele discrimination and array based labeling and detection. 
The reagents for the CodeLink array are supplied in the P450 Reagent Kit, which provides enough 
5 reagents for 24 reactions. 

The CodeLink™ platform consists of a glass slide that has been silanized to generate 
coverage with long-chain alkyl groups (Ramakrishnan et al., Nucleic Acids Research, Vol. 30, No. 
7, e30, April 1, 2002). A prepolymer, which contains an activated ester, of acrylamide is photo- 
coupled to the slide. The activated ester provides the attachment site for C6-amino- 

10 oligonucleotides. 5'Amine-terminated oligonucleotides are deposited onto the polymer using 
dispensing robotics. Oligonucleotides are dispensed with a fluorescein-derivative dye, which 
enables scanning of every slide after dispensing. To enable attachment of the oligonucleotides, the 
slides are placed into a humidified chamber. Additional sites are blocked and slides are washed, 
rinsed and dried prior to attachment of an integrated, polypropylene hybridization chamber. 

15 A custom CodeLink Array could be synthesized containing SNPs located on the 

chromosome of interest; SNPs located on any chromosome may be analyzed. In this example, a 
representative CodeLink Array with two SNPs located on chromosome 21 (TSC0271628, 
TSC0069805) and two SNPs located on chromosome 13 (TSC0466917, TSC1 172576) is used. 
However, a CodeLink Array with any number of probes for SNPs, including but not limited to 1- 

20 10, 11-20, 21-30, 31-40, 41-50, 51-60, 61-70, 71-80, 81-90, 91-100, 101-110, 111-120, 121-130, 
131-140, 141-150, 151-160, 161-170, 171-180, 181-190, 191-200, 201-300, 301-400, 401-500, 
501-600, 601-700, 701-800, 801-900, 901-1000, 1001-2000, 2001-3000, 3001-4000, 4001-5000, 
5001-6000, 6001-7000, 7001-8000, 8001-9000, 9001-10,000 and greater than 10,000 probes can be 
used. The SNPs that are used can be located on a single chromosome, more than one chromosome, 

25 or any combination of chromosomes. 

The CodeLink Array consists of a single stranded probe attached to a surface (a detailed 
diagram is available at 

http://ww5.amershambiosciences.c^^ snp) The 

nucleotide farthest from the surface corresponds to the locus of interest. 
30 For example, SNP TSC09466917, which is located on chromosome 13, can either be a 

cytosine nucleotide or a guanine nucleotide. Two types of probes are made: TAAAAG and 
TAAAC. The probes are attached with the guanine or cytosine farthest from the surface. The 
probes can be longer or shorter in sequence. 
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The nucleotides at SNP TS1 172576, which is located on chromosome 13, can either be 
thymidine or adenine. Two probes are made: TACCGCATAT and TACCGCATAA. The probes 
for the SNPs on the CodeLink Array can be of the same length or different lengths. 

The nucleotides at SNP TSC0271628, which is located on chromosome 21, can either be 
5 adenine or guanine. Two probes are made: TTTCCACTCATCCAG and TTTCCACTCATCCAA. 
The probes for the SNPs can either be to the sense or antisense strand of the DNA. 

The nucleotides at SNP TSC0069085, which is located on chromosome 21, can either be 
cytosine or thymidine. Two probes are made: TATCTTAATAC and TATCTTAATAT. 

Representative probes for these four SNPs are provided above. However, the probes can be 
10 longer or shorter in sequence and can be made to the sense or antisense strand of the DNA. In 

addition, more than one probe to each SNP can be used. Multiple probes with different sequences 
or the same sequences can be used for the same SNP. For example, one probe for SNP 
TSC0069085 can be of 10 nucleotides in length and another probe for SNP TSC0069085 can be of 
20 nucleotides in length. Alternatively, one probe for SNP TSC0069085 can be to the sense strand 
1 5 and a second probe for SNP TSC0069085 can be to the antisense strand. 

PGR Amplification 

Once a CodeLink Array is designed, the next step is amplification. Any method of 
20 amplification can be used, preferably PGR. Primers are designed to amplify two SNPs on 

chromosome 13 and two SNPs located on chromosome 21 using information provided at the SNP 
Consortium (http://siip.cshl.org) . The skilled artisan will understand how to design primers for any 
other SNP of interest. 

25 SNP TSC0466917 (C/G), which is located on chromosome 13, is amplified using the following 
primers: 

Upstream Primer: 

30 5' CCAGCTGGTAGAACTT 3' 

Downstream Primer: 

5' CCCAATAGACCTATAG 3' 
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SNP TSC1 172576 (T/A), which is located on chromosome 13, is amplified using the 
following primers: 

Upstream Primer: 
5 5' TAGCAGAATCTCTCAT 3' 

Downstream Primer: 

5' AGAGTATCTCATTTGTT 3' 

10 SNP TSC0271628 (A/G), which is located on chromosome 21, is amplified using the 

following primers: 

Upstream Primer: 

5' AGGAAATTGTGAAGTA 3' 

15 

Downstream Primer: 

5' TAACTCACTCACTATC 3' 

SNP TSC0069085 (C/T), which is located on chromosome 21, is amplified using the 
20 following primers: 

Upstream Primer: 

5* CTGCTGAGTCATAGTC 3' 

25 Downstream Primer: 

5' TGTTCTTTGAATCAAC 3' 

Oligonucleotide probes may be of any length including but not limited to 5-10, 1 1-15, 16- 
20, 21-25, 26-29, 30, 31-35, 36-45, 46-55, 56-65, 66-75, and greater than 75 bases in length. 

30 PCR conditions should be designed according to the manufacturer's suggestion. 

Representative PCR conditions are provided here. The loci of interest are amplified in separate 
reaction tubes but they can also be amplified together in a single PCR reaction. For increased 
specificity, a "hot-start" PCR is used. PCR reactions are performed using the HotStarTaq Master 
Mix Kit supplied by QIAGEN (catalog number 203443). The amount of template DNA and primer 

35 per reaction can be optimized for each locus of interest but in this example, 40 ng of template 
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human genomic DNA and 5 of each primer are used. Forty cycles of PCR are performed. The 
following PCR conditions are used: 

(1) 95°C for 15 minutes and 15 seconds; 

(2) 95°Cfor30 seconds; 
5 (4) 57°C for 30 seconds; 

(5) 72°C for 30 seconds; 

(6) Repeat steps 2-5 thirty two (32) times; 

(7) 72°C for 5 minutes. 

1 0 PCR Purification and Fragmentation 

The PCR products are purified following the procedures recommended by the makers of 
CodeLink. One mode of PCR purifications is provided here. PCR products are separated from the 
components of the PCR reaction using Qiagen's MinElute PCR Purification Kit following 
manufacturer's instructions (Catalog number 28006). 
1 5 The purified PCR products are fragmented following the procedures recommended by the 

makers of CodeLink. The PCR products can be fragmented using enzymes including but not 
limited to DNase or using mechanical forces, including but not limited to sonication or shearing 
through a needle. 

20 Hybridization 

The purified, fragmented PCR products are hybridized to the CodeLink Arrays following 
the procedures recommended by the makers of CodeLink. Typical hybridization procedures 
involve hybridizing the target probe and the sample DNA in a solution containing mild detergents, 
such as SDS or Triton-100, salts, such as MgCl 2 , and a buffer such as Tris or PBS. The 
25 hybridization reaction is performed usually with mild shaking at 25-44°C for 8-16 hours. The 
hybridization reactions can be performed using the Shaker Kit, available for Amersham 
Biosciences. 

Alternatively, hybridizations are performed using the Motorola 12-slide shaker tray, which 
contains hybridization chambers. The hybridization chamber ports are sealed with 1 cm sealing 
30 strips (Motorola Life Sciences) and the shaker trays containing the slides are loaded into the New 
Brunswick Innova™ 4080 shaking incubator, with the hybridization chambers facing up. Slides are 
incubated for 18 h at 37°C, while shaking at 3000 r.p.m. 

There may be specific procedures and solutions for hybridization with CodeLink Arrays. 
Representatives at CodeLink would not provide the protocols without prior purchase of an array. 

35 
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Allele Specific Extension 

After hybridization, an extension reaction is performed following the procedures 
recommended by the makers of CodeLink. A typical extension reaction requires the presence of 
nucleotides (dATP, dCTP, dGTP, dTTP), a polymerase, and a buffered solution containing salts. 
5 The concentration of nucleotides and the polymerase that is used should be as recommended by the 
makers of CodeLink. Any DNA polymerase can be used including but not limited to E. coli DNA 
polymerase, Klenow fragment of E. coli DNA polymerase I, T7 DNA polymerase, T4 DNA 
polymerase, T5 DNA polymerase, Klenow class polymerases, Taq polymerase, Pfu DNA 
polymerase, Vent polymerase, bacteriophage 29, REDTaq™ Genomic DNA polymerase, or 
10 sequenase. Preferably, the polymerase that is used is the one recommended by the makers of 
CodeLink. 

Wash, Label and Detect 

After the extension reaction, the arrays are washed, preferably with the solution 
15 recommended by the makers of CodeLink. A typical wash solution contains detergents) and salts 
in a buffered solution. For example, the arrays can be washed with a TNT buffer (0. 1 M Tris-HCl, 
pH 7.6, 0. 1 5 M NaCl, 0.05% Tween-20. The arrays are washed at the temperature recommended 
by the makers of CodeLink including but not limited to room temperature, 10-16°C, 17-24°C, 
25°C, 26-36°C, 37°C, 38-41°C, 42°C, 43-54°C, 55°C, or higher than 55°C. 
20 The signal is developed following the protocol recommended by the makers of CodeLink. 

A representative protocol is provided here. The signal is developed using a 1:500 dilution of a 
streptavidin-Alexa 647 (Molecular Probes), for 30 minutes at room termperature. Excess dye is 
removed by washing four times with TNT buffer, for 5 min each, at room temperature. The signal 
intensity can be increased by using a 1:200 dilution of tyramide-cy3 in amplification dilutent buffer 
25 (PE/NEN). 

Slides are rinsed with deionized water and dried using a nitrogen gun. Processed slides are 
scanned using the Axon GenePix Scanner with the laser set to 635 nm, the photomultiplier tube 
voltage to 600 and the scan resolution to 10|i. Slides are scanned and analyzed using CodeLink™ 
SNP Software kit, which is available from Amersham Biosciences. 
30 Each SNP is genotyped. SNPs located on chromosomes 13 and 21, wherein the maternal 

DNA is homozygous, are analyzed with the DNA isolated from the plasma. 

Analysis of DNA Isolated from Maternal Plasma 

After the maternal DNA is analyzed and homozygous SNPs are identified, these SNPs are 
35 analyzed with the DNA isolated from the plasma. A low copy number of fetal genomes typically is 



345 



WO 2004/078994 



PCT/US2004/006337 



present in the maternal plasma. To increase the copy number of the loci of interest, which are the 
SNPs at which the maternal DNA is homozygous, primers are designed to anneal at approximately 
130 bases upstream and 130 bases downstream of each loci of interest. This is done to reduce 
statistical sampling error that can occur when working with a low number of genomes, which can 
5 influence the ratio of one allele to another (see Example 1 1). 

Design of Multiplex Primers 

The primers are 12 bases in length. However, primers of any length can be used including 
but not limited to 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 
10 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36-45, 46-55, 56-65, 66-75, 76-85, 86-95, 96-105, 106-115, 
116-125, and greater than 125 bases. Primers are designed to anneal to both the sense strand and 
the antisense strand. 

The maternal homozygous SNPs vary from sample to sample so defined sequences are not 
provided here. Primers are designed to anneal about 130 bases upstream and downstream of the 
15 maternal homozygous SNPs. The primers are designed to terminate at the 3' end in the dinucleotide 
"AA" to reduce the formation of primer-dimers. However, the primers can be designed to end in 
any of the four nucleotides and in any combination of the four nucleotides. 

i 

Multiplex PCR 

20 Regions upstream and downstream of the maternal homozygous SNPs are amplified from 

the template genomic DNA using the polymerase chain reaction (PCR, U.S. Patent Nos. 4,683,195 
and 4,683,202, incorporated herein by reference). This PCR reaction uses primers that anneal 
approximately 130 bases upstream and downstream of each loci of interest. The primers are mixed 
together and are used in a single reaction to amplify the template DNA. This reaction is done to 

25 increase the number of copies of the loci of interest, which eliminates error generated from a low 
number of genomes. 

For increased specificity, a "hot-start" PCR reaction is used. PCR reactions are performed 
using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). The amount 
of template DNA and primer per reaction is optimized for each locus of interest. In this example, 
30 the 20 jil of plasma template DNA is used. 

Two microliters of each forward and reverse primer, at concentrations of 5 mM are pooled 
into a single microcentrifuge tube and mixed. Four microliters of the primer mix is used in a total 
PCR reaction volume of 50 pi (20|d of template plasma DNA, 1 |xl of sterile water, 4 jil of primer 
mix, and 25 \xl of HotStar Taq. Twenty-five cycles of PCR are performed. The following PCR 
35 conditions are used: 
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(1) 95°C for 15 minutes; 

(2) 95°Cfor30 second; 

(3) 4°Cfor30 seconds; 

(4) 37°C for 30 seconds; 

5 (5) Repeat steps 2-4 twenty-four (24) times; 

(6) 72°C for 10 minutes. 
The temperatures and times for denaturing, annealing, and extension, are optimized by 
trying various settings and using the parameters that yield the best results. 

Other methods of genomic amplification can also be used to increase the copy number of 
10 the loci of interest including but not limited to primer extension preamplification (PEP) (Zhang et 
aL, PNAS, 89:5847-51, 1992), degenerate oligonucleotide primed PCR (DOP-PCR) (Telenius, et 
aL, Genomics 13:718-25, 1992), strand displacement amplification using DNA polymerase from 
bacteriophage 29, which undergoes rolling circle replication (Dean et aL, Genomic Research 
11:1095-99, 2001), multiple displacement amplification (U.S. Patent 6,124,120), REPLI-g™ Whole 
1 5 Genome Amplification kits, and Tagged PCR. 

It is important to ensure that the region amplified (done to increase the copy number of the 
fetal loci of interest) contains annealing sequences for the primers that are used with the CodeLink 
assay. Upon purchase of the CodeLink array, each SNP and the primers used to amplify each SNP 
can be identified. With this knowledge, the multiplex primers are designed to encompass annealing 
20 regions for the primers in the HuSNP Array. 

Purification of Fragment of Interest 

The unused primers, and nucleotides are removed from the reaction by using Qiagen 
MinElute PCR purification kits (Qiagen, Catalog Number 28004). The reactions are performed 
25 following the manufacturer's instructions supplied with the columns. The DNA is eluted in 100 \i\ 
of sterile water. 5 \il of each amplified loci is mixed together 

CodeLink Assay, Washing, Staining and Scanning 

The pooled DNA is assayed with the CodeLink Array as described above. Washing, 
30 staining, and scanning procedures are as described above. 

Each SNP is genotyped. SNPs located on chromosomes 13 and 21, wherein the maternal 
DNA is homozygous, and DNA isolated from the plasma is heterozygous are quantitated. 

Quantification 
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The intensity of the signal for each allele at a heterozygous is SNP is quantitated. As 
discussed above, the expected ratio of allele 1 to allele 2 can be used to determine the presence or 
absence of a chromosomal abnormality. If the maternal genome is homozygous at SNP X (A/A), 
and the plasma DNA is heterozygous at SNP X (A/G), then the G represents the distinct fetal 
5 signal. The ratio of G: A depends on the percentage of fetal DNA present in the maternal blood. 

For example, if the sample contains 50% fetal DNA, then the expected ratio is 0.33 (1 fetal 
G allele/(2 maternal A alleles + 1 fetal A allele)). This ratio should be constant for all 
chromosomes that are present in two copies. The ratio that is obtained for SNPs on chromosome 13 
should be the same as the ratio that is obtained for chromosome 21. 

10 However, if the fetal genome contains an additional copy of chromosome 21, then the ratio 

for this chromosome will deviate from the expected ratio. The expected ratio for a Trisomy 
condition with 50% fetal DNA in the maternal blood is 0.25. Thus, by analyzing SNPs wherein the 
maternal genome is homozygous, and the DNA that is isolated from the plasma is heterozygous, 
fetal chromosomal abnormalities can be detected. 

15 This example explained the use of CodeLink Arrays, but it not intended to limit the use of 

arrays. Any DNA array may be used including but not limited to the DNA arrays listed in Table 
XXin, or DNA arrays available from any of the companies listed in Table XXTV. 

20 EXAMPLE 18 

Fetal chromosomal abnormalities are determined by analyzing SNPs wherein the maternal 
template DNA is homozygous and the template DNA obtained from the plasma is heterozygous. 
Plasma that is isolated from blood of a pregnant female contains both maternal template DNA and 
25 fetal template DNA. Any number of SNP detection methods can be used to analyze the maternal 
and plasma DNA. In this example, SNPs are analyzed using Illumina's BeadArray™ platform, 
available form Illumina in San Diego, CA. 

Collection of Blood Samples 

30 In accordance with an IRB approved study, blood samples are collected from pregnant 

women after informed consent is granted. Blood is collected into 9 ml EDTA Vacuette tubes 
(catalog number NC9897284) and 0.225 ml of 10% neutral buffered solution containing 
formaldehyde (4% w/v), is added to each tube, and each tube gently is inverted. The tubes are 
stored at 4°C until ready for processing. 
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Any number of agents that impede cell lysis or stabilize cell membranes can be added to the 
tubes including but not limited to formaldehyde, and derivatives of formaldehyde, formalin, 
glutaraldehyde, and derivatives of glutaraldehyde, crosslinkers, primary amine reactive crosslinkers, 
sulfhydryl reactive crosslinkers, sulfhydryl addition or disulfide reduction, carbohydrate reactive 
5 crosslinkers, carboxyl reactive crosslinkers, photoreactive crosslinkers, cleavable crosslinkers, 
AEDP, APG, BASED, BM(PEO)3, BM(PEO) 4 , BMB, BMDB, BMH, BMOE, BS3, BSOCOES, 
DFDNB, DMA, DMP, DMS, DPDPB, DSG, DSP, DSS, DST, DTBP, DTME, DTSSP, EGS, 
HBVS, sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS or compounds listed in Table XXHI. Any 
concentration of agent that stabilizes cell membranes or impedes cell lysis can be added. In a 

10 preferred embodiment, the agent that stabilizes cell membranes or impedes cell lysis is added at a 
concentration that does not impede or hinder subsequent reactions. 

An agent that stabilizes cell membranes may be added to the maternal blood sample to 
reduce maternal cell lysis including but not limited to aldehydes, urea formaldehyde, phenol 
formaldehyde, DMAE (dimethylaminoethanol), cholesterol, cholesterol derivatives, high 

15 concentrations of magnesium, vitamin E, and vitamin E derivatives, calcium, calcium gluconate, 
taurine, niacin, hydroxylamjne derivatives, bimoclomol, sucrose, astaxanthin, glucose, 
amitriptyline, isomer A hopane tetral phenylacetate, isomer B hopane tetral phenylacetate, 
citicoline, inositol, vitamin B, vitamin B complex, cholesterol hemisuccinate, sorbitol, calcium, 
coenzyme Q, ubiquinone, vitamin K, vitamin K complex, menaquinone, zonegran, zinc, ginkgo 

20 biloba extract, diphenylhydantoin, perftoran, polyvinylpyrrolidone, phosphatidylserine, tegretol, 

PABA, disodium cromglycate, nedocromil sodium, phenytoin, zinc citrate, mexitil, dilantin, sodium 
hyaluronate, or polaxamer 188. 



Isolation of Plasma and Maternal Cells 

25 The blood is stored at 4°C until processing. The tubes are spun at 1 000 rpm for ten minutes 

in a centrifuge with braking power set at zero. The tubes are spun a second time at 1000 rpm for 
ten minutes. The supernatant (the plasma) of each sample is transferred to a new tube and spun at 
3000 rpm for ten minutes with the brake set at zero. The supernatant is transferred to a new tube 
and stored at -80°C. Approximately two milliliters of the "bufily coat," which contains maternal 

30 cells, is placed into a separate tube and stored at -80°C. 

Isolation of DNA 

DNA is isolated from the plasma sample using the Qiagen Midi Kit for purification of DNA 
from blood cells, following the manufacturer's instructions (QIAmp DNA Blood Midi Kit, Catalog 
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number 51 183). DNA is eluted in 100 \il of distilled water. The Qiagen Midi Kit also is used to 
isolate DNA from the maternal cells contained in the "buffy coat." 

Identification of Homozygous Maternal SNPs 
5 Illumina's BeadArray™ technology consists of a fiber optic based array system that 

allegedly allows miniaturized, very-high throughput genetic analysis. Illumina's 96-bundle Sentrix 
Array™ allegedly enable parallel processing of nearly 150 3 000 SNPs. 

Fiber bundles are manufactured to contain nearly 50,000 individual, light transmitting fiber 
strands. Each fiber bundle is converted into an array by first chemically etching a microscopic well 
10 at the end of each fiber strand within a bundle, which creates up to 50, 000 discrete microscopic 
wells per bundle. 

In a separate process, sensors are created by affixing a specific type of molecule to the 
beads, each bead approximately 3 microns in diameter. For SNP analysis, a particular DNA 
sequence is attached to each bead in a batch. Illumina states that hundreds of thousands of 
15 molecules of the same type coat each bead. Batches of coated beads are combined to form a pool 
specific to the type of array desired. For SNP analysis, the array pool allegedly uses DNA 
sequences that do not cross hybridize with themselves or with known genomic DNA. 

Next, the self-assembled array is created. By dipping bundles into a pre-mixed bead pool, 
the coated beads self-assemble individually, one bead per well, on the end of each fiber in the 
20 bundle to create the array. In Illumina's SNP genotyping array, the bead pool consists of up to 
1500 sequences, which self assemble in each bundle of 50,000 fibers to create an array with an 
approximately thirty-fold redundancy. 

The BeadArray bundles are assembled into a matrixed device, which is called the Array of 
Arrays™ platform, where each fiber bundle of the larger array matches a well of a standardized 
25 microtiter plate. 

Following array assembly, a decoding process is used to determine the bead type that 
resides in each fiber core. The DNA molecules are synthesized using the Oligator™ custom DNA 
synthesis technology. 

Illumina's SNP genotyping service using the BeadArray technology and other technologies 
30 that have sprung from the BeadArray technology are provided at Illumina's facilities or at facilities 
that have received a license to the BeadArray technology. 

Maternal DNA samples are analyzed using BeadArrays that contain oligonucleotide probes 
to SNPs. The oligonucleotide probes can be for SNPs located on any chromosome including 
human chromosome 1,2, 3,4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, X, and 
35 Y. The BeadArrays are analyzed to identify SNPs, wherein the maternal template DNA is 
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homozygous. The identified homozygous SNPs are then analyzed using the DNA isolated from the 
maternal plasma. 

Analysis of DNA Isolated from Maternal Plasma 

5 After the maternal DNA is analyzed and homozygous SNPs are identified, these SNPs are 

analyzed with the DNA isolated from the plasma. A low copy number of fetal genomes typically 
exists in the maternal plasma. To increase the copy number of the loci of interest, which are the 
SNPs at which the maternal DNA is homozygous, primers are designed to anneal at approximately 
130 bases upstream and 130 bases downstream of each loci of interest. This is done to reduce 
1 0 statistical sampling error that can occur when working with a low number of genomes, which can 
influence the ratio of one allele to another (see Example 1 1). 

Design of Multiplex Primers 

The primers are 12 bases in length. However, primers of any length can be used including 
15 but not limited to 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 
26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36-45, 46-55, 56-65, 66-75, 76-85, 86-95, 96-105, 106-115, 
116-125, and greater than 125 bases. Primers are designed to anneal to both the sense strand and 
the antisense strand. 

The maternal homozygous SNPs vary from sample to sample so defined sequences are not 
20 provided here. Primers are designed to anneal about 130 bases upstream and downstream of the 
maternal homozygous SNPs. The primers are designed to terminate at the V end in the dinucleotide 
"AA" to reduce the formation of primer-dimers. However, the primers can be designed to end in 
any of the four nucleotides and in any combination of the four nucleotides. 

25 Multiplex PCR 

Regions upstream and downstream of the maternal homozygous SNPs are amplified from 
the template genomic DNA using the polymerase chain reaction (PCR, U.S. Patent Nos. 4,683,195 
and 4,683,202, incorporated herein by reference). This PCR reaction uses primers that anneal 
approximately 130 bases upstream and downstream of each loci of interest. The primers are mixed 
30 together and are used in a single reaction to amplify the template DNA. This reaction is done to 
increase the number of copies of the loci of interest, which eliminates error generated from a low 
number of genomes. 

For increased specificity, a "hot-start" PCR reaction is used. PCR reactions are performed 
using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). The amount 
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of template DNA and primer per reaction is optimized for each locus of interest In this example, 
the 20 \xl of plasma template DNA is used. 

Two microliters of each forward and reverse primer, at concentrations of 5 mM are pooled 
into a single microcentrifuge tube and mixed. Four microliters of the primer mix is used in a total 
5 PCR reaction volume of 50 \il (20pl of template plasma DNA, 1 \il of sterile water, 4 |al of primer 
mix, and 25 jil of HotStar Taq. Twenty-five cycles of PCR are performed. The following PCR 
conditions are used: 

(1) 95°C for 15 minutes; 

(2) 95°C for 30 second; 
1 0 (3) 4°C for 30 seconds; 

(4) 37°C for 30 seconds; 

(5) Repeat steps 2-4 twenty-four (24) times; 

(6) 72°C for 10 minutes. 

The temperatures and times for denaturing, annealing, and extension, are optimized by 
1 5 trying various settings and using the parameters that yield the best results. 

Other methods of genomic amplification can also be used to increase the copy number of 
the loci of interest including but not limited to primer extension preamplification (PEP) (Zhang et 
al., PNAS, 89:5847-51, 1992), degenerate oligonucleotide primed PCR (DOP-PCR) (Telenius, et 
aL, Genomics 13:718-25, 1992), strand displacement amplification using DNA polymerase from 
20 bacteriophage 29, which undergoes rolling circle replication (Dean et aL, Genomic Research 
11:1095-99, 2001), multiple displacement amplification (U.S. Patent 6,124,120), REPLI-g™ Whole 
Genome Amplification kits, and Tagged PCR. 

It is important to ensure that the region amplified contains annealing sequences for the 
oligonucleotide probes in the BeadArray. Upon purchase of the BeadArray service, each SNP and 
25 the primers used to analyze each SNP are identified. With this knowledge, the multiplex primers 
are designed to encompass annealing regions for the primers in the BeadArray. 

Purification of Fragment of Interest 

The unused primers, and nucleotides are removed from the reaction by using Qiagen 
30 MinElute PCR purification kits (Qiagen, Catalog Number 28004). The reactions are performed 
following the manufacturer's instructions supplied with the columns. The DNA is eluted in 100 jliI 
of sterile water. 5 |il of each amplified loci is mixed together 

BeadArray Technology 
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The pooled DNA is assayed with the BeadArray as described above. Each SNP is 
genotyped. SNPs located on chromosomes 13 and 21, wherein the maternal DNA is homozygous, 
and DNA isolated from the plasma is heterozygous are quantitated. 

5 Quantification 

The intensity of the signal for each allele at a heterozygous is SNP is quantitated. As 
discussed above, the expected ratio of allele 1 to allele 2 can be used to determine the presence or 
absence of a chromosomal abnormality. If the maternal genome is homozygous at SNP X (A/A), 
and the plasma DNA is heterozygous at SNP X (A/G), then the G represents the distinct fetal 

10 signal. The ratio of G:A depends on the percentage of fetal DNA present in the maternal blood. 

For example, if the sample contains 50% fetal DNA, then the expected ratio is 0.33 (1 fetal 
G allele/(2 maternal A alleles + 1 fetal A allele)). This ratio should be constant for all 
chromosomes that are present in two copies. The ratio that is obtained for SNPs on chromosome 13 
should be the same as the ratio that is obtained for chromosome 21. 

1 5 However, if the fetal genome contains an additional copy of chromosome 21, then the ratio 

for this chromosome will deviate from the expected ratio. The expected ratio for a Trisomy 
condition with 50% fetal DNA in the maternal blood is 0.25. Thus, by analyzing SNPs wherein the 
maternal genome is homozygous, and the DNA that is isolated from the plasma is heterozygous, 
fetal chromosomal abnormalities can be detected. 

20 This example explained the use of Illumina's BeadArray Technology, but it not intended to 

limit the use of arrays. Any DNA array may be used including but not limited to the DNA arrays 
listed in Table XXEQ, or DNA arrays available from any of the companies listed in Table XXIV. 

EXAMPLE 19 

25 Fetal chromosomal abnormalities are determined by analyzing SNPs wherein the maternal 

template DNA is homozygous and the template DNA obtained from the plasma is heterozygous. 
Plasma that is isolated from blood of a pregnant female contains both maternal template DNA and 
fetal template DNA. Any number of SNP detection methods can be used to analyze the maternal 
and plasma DNA. In this example, SNPs are analyzed using Sequenom's MassArray™ System, 

30 which uses Sequenom's homogenous MassCleave™ (hMC) method. 

Collection of Blood Samples 

In accordance with an IRB approved study, blood samples are collected from pregnant 
women after informed consent is granted. Blood is collected into 9 ml EDTA Vacuette tubes 
35 (catalog number NC9897284) and 0.225 ml of 10% neutral buffered solution containing 
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formaldehyde (4% w/v), is added to each tube, and each tube gently is inverted. The tubes are 
stored at 4°C until ready for processing. 

Any number of agents that impede cell lysis or stabilize cell membranes can be added to the 
tubes including but not limited to formaldehyde, and derivatives of formaldehyde, formalin, 
5 glutaraldehyde, and derivatives of glutaraldehyde, crosslinkers, primary amine reactive crosslinkers, 
sulfliydryl reactive crosslinkers, sulfhydryl addition or disulfide reduction, carbohydrate reactive 
crosslinkers, carboxyl reactive crosslinkers, photoreactive crosslinkers, cleavable crosslinkers, 
AEDP, APG, BASED, BM(PEO) 3 , BM(PEO) 4 , BMB, BMDB, BMH, BMOE, BS3, BSOCOES, 
DFDNB, DMA, DMP, DMS, DPDPB, DSG, DSP, DSS, DST, DTBP, DIME, DTSSP, EGS, 

10 HBVS, sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS or compounds listed in Table XXm. Any 
concentration of agent that stabilizes cell membranes or impedes cell lysis can be added. In a 
preferred embodiment, the agent that stabilizes cell membranes or impedes cell lysis is added at a 
concentration that does not impede or hinder subsequent reactions. 

An agent that stabilizes cell membranes may be added to the maternal blood sample to 

1 5 reduce maternal cell lysis including but not limited to aldehydes, urea formaldehyde, phenol 
formaldehyde, DMAE (dimethylaminoethanol), cholesterol, cholesterol derivatives, high 
concentrations of magnesium, vitamin E, and vitamin E derivatives, calcium, calcium gluconate, 
taurine, niacin, hydroxylamine derivatives, bimoclomol, sucrose, astaxanthin, glucose, 
amitriptyline, isomer A hopane tetral phenylacetate, isomer B hopane tetral phenylacetate, 

20 citicoline, inositol, vitamin B, vitamin B complex, cholesterol hemisuccinate, sorbitol, calcium, 
coenzyme Q, ubiquinone, vitamin K, vitamin K complex, menaquinone, zonegran, zinc, ginkgo 
biloba extract, diphenylhydantoin, perftoran, polyvinylpyrrolidone, phosphatidylserine, tegretol, 
PABA, disodium cromglycate, nedocromil sodium, phenytoin, zinc citrate, mexitil, dilantin, sodium 
hyaluronate, or polaxamer 188. 

25 

Isolation of Plasma and Maternal Cells 

The blood is stored at 4°C until processing. The tubes are spun at 1000 rpm for ten minutes 
in a centrifuge with braking power set at zero. The tubes are spun a second time at 1000 rpm for 
30 ten minutes. The supernatant (the plasma) of each sample is transferred to a new tube and spun at 
3000 rpm for ten minutes with the brake set at zero. The supernatant is transferred to a new tube 
and stored at -80°C. Approximately two milliliters of the "buffy coat," which contains maternal 
cells, is placed into a separate tube and stored at -80°C. 
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Isolation of DNA 

DNA is isolated from the plasma sample using the Qiagen Midi Kit for purification of DNA 
from blood cells, following the manufacturer's instructions (QIAmp DNA Blood Midi Kit, Catalog 
number 5 1 1 83). DNA is eluted in 1 00 \i\ of distilled water. The Qiagen Midi Kit also is used to 
5 isolate DNA from the maternal cells contained in the "buffy coat." 

Identification of Homozygous Maternal SNPs 

Targeted SNP Discovery: hMC Method 
10 Sequenom's hMC method uses nucleotide base-specific cleavage for genotyping. The cleaved 
fragments are measured using MALDI-TOF to generate a characteristic peak signal, based on the 
mass of each fragment, for any particular sequence. 



Primer Design 

15 Four primers are needed for the two PCR reactions (one forward reaction and one reverse 

reaction). The recommended size range for PCR amlicons is 300-700 base pairs. The primers 
contain a T-7 promoter tagged forward or reverse primer to obtain an appropriate product for in 
vitro transcription. An 8 base insert is included to prevent abortive cycling. The primer that lacks 
the T-7 promoter contains a 10-mer tag in order to balance the primers. 

20 The primers for one SNP are provided below. SNP TSC1 172576 (T/A), which is located 

on chromosome 13, is amplified using the following primers for the forward reaction: 



Forward reaction: 

Upstream Primer: 

25 5' CAGTAATACGACTCACTATAGG GGTCAGGAT TAGCAGAATCTCTCAT 3 ' 

Downstream Primer: 
5' GCATTCTATGA GAGTATCTCATTTGTT 3' 

30 Reverse Reaction: 

Upstream Primer 

5' CAGTAATACGACTCACTATAGG GGTCAGGAA GAGTATCTCATTTGTT 3' 

Downstream Primer 
35 5' GCATTCTATG TAGCAGAATCTCTCAT 3' 
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The sequence of the T-7 promoter are in italics, the 8 base insert is underlined, the 10- base 
balancing sequence is double underlined, and the gene specific sequences are unmodified. 

5 PCR Amplification 

Five nanograms of DNA is amplified in a 5 \il volume using a 384-microtiter format. The 
following PCR conditions are used: 

1) 94°C for 15 minutes; 

2) 94°C for 20 seconds; 
10 3) 62°C for 30 seconds; 

4) 72°C for 1 minute; 

5) Repeat steps 2-4 44 times; and 

6) 72°C for 3 minutes. 

15 Dephosphorylation 

Shrimp Alkaline Phosphatase (SAP) (2 is added to each 5 pi PCR reaction to 
dephosphorylate unincorporated dNTPs from the PCR reaction. The plates are incubated at 37°C 
for 20 minutes. Then, the plates are incubated at 85°C for 5 minutes. 

20 In Vitro Transcription 

For each transcription reaction, 2 jxl of transcription cocktail and 2 \x\ of PCR/SAP sample 
are needed. Add 2 |xl of transcription cocktail and 2 |xl of PCR/SAP sample to a new microtiter 
plate. The plates are incubated at 37°C for two hours. For detailed information regarding these 
protocols see the "Processing homogeneous MassCLEAVE Reactions" chapter in the MassARRAY 
25 Liquid Handler SNP Discovery User's Guide for instructions, which is fully incorporated herein by 
reference. 

RNase A Cleavage 

RNase A cocktail (2.5 yl) is added to each reaction (T cleavage and C cleavage). The 
30 plates are incubated at 37°C for one hour. 

Depending on the nucleotide at the SNP site, various fragments of different weights are 
generated. For example, the DNA sequence surrounding SNP TSC1 172576, which is located on 
chromosome 13, is as follows: 

35 5' CCGCATA T/A CTCAGCACA 3' 
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3'GGCGTAT A/T GAGTCGTGT 5' 



After PCR, in vitro transcription, and base-specific cleavage, the following fragments for 
each allele are generated: 



T allele 



A allele 



Products of forward 1) TATCTCA 
Transcription 2) ATC 



1) CTCAGC 

2) AAC 



Products of reverse 
transcription 



1) AGTTA 

2) GTTATGC 



1) AGATA 

2) ATATGCG 



For the ATC and AAC fragments, the weight difference between T and A is used to 
determine the genotype at SNP TSC1 172576. Likewise, the weight difference between T and A in 
fragments AGTTA and AGATA is used to determine the genotype at SNP TSC1 172576. 

Sample Conditioning 

Double distilled water (20 is added to each sample within the 384-well plate. Clean 
Resin (6 mg) is added to each well. The plate is rotated for 10 minutes, followed by a- 
centrifugation at 3200 X g. It is recommended that water always be added before the Clean Resin. 

Sample Transfer 

The hMC reaction product (10-15^1) is dispensed onto a 384 element SpectroCHIP®. For 
further information, see the "Dispensing MassCLEAVE Reaction Products onto SpectroCfflPs" 
chapter in the MassARRAY Nanodispenser SNP Discovery User's Guide for instructions. 

Sample Analysis 

Spectra from the four cleavage reactions is acquired using the MassARRAY™ system. For 
further instructions, see the "Acquiring Spectra" chapter in the MassARRAY Discovery RT 
Software User's guide for instructions on acquiring spectra from SpectroCHPS®. 

SNP Analysis 

The results are analyzed using the SNP Discovery Analysis software. For further 
instructions, see the "Analyzing SNPs" chapter in the MassARRAY Discovery RT Software User's 
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Guide for instructions on using the SNP Discovery Analysis software. Components that are useful 
for the MassARRAY procedure include MassARRAY™ Analyzer (part number 004500), 
MassARRAY™ Discovery RT Software version 1.2 (part number 1 1434), MassARRAY™ SNP 
Discovery Starter Kit (part number 10027), and Liquid Handler SNP Discovery Methods and 
5 Macros (part number 1 1433). 

The SpectroCHIP array is used to genotype the maternal DNA following the 
manufacturer's recommended protocols and procedures, which are made available after purchase of 
the SpectroCHIP array. SNPs at which the maternal DNA is homozygous are used to analyze the 
DNA isolated from the maternal plasma. 

10 

Analysis of DNA Isolated from Maternal Plasma 

After the maternal DNA is analyzed and homozygous SNPs are identified, these SNPs are 
analyzed with the DNA isolated from the plasma. A low copy number of fetal genomes typically 
exists in the maternal plasma. To increase the copy number of the loci of interest, which are the 
1 5 SNPs at which the maternal DNA is homozygous, primers are designed to anneal at approximately 
130 bases upstream and 130 bases downstream of each loci of interest. This is done to reduce 
statistical sampling error that can occur when working with a low number of genomes, which can 
influence the ratio of one allele to another (see Example 11). 

20 Design of Multiplex Primers 

The primers are 12 bases in length. However, primers of any length can be used including 
but not limited to 2, 3, 4, 5, 6, 7, 3, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 
26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36-45, 46-55, 56-65, 66-75, 76-85, 86-95, 96-105, 106-115, 
116-125, and greater than 125 bases. Primers are designed to anneal to both the sense strand and 

25 the antisense strand. 

The maternal homozygous SNPs vary from sample to sample so defined sequences are not 
provided here. Primers are designed to anneal about 130 bases upstream and downstream of the 
maternal homozygous SNPs. The primers are designed to terminate at the 3' end in the dinucleotide 
"AA" to reduce the formation of primer-dimers. However, the primers can be designed to end in 

30 any of the four nucleotides and in any combination of the four nucleotides. 

Multiplex PCR 

Regions upstream and downstream of the maternal homozygous SNPs are amplified from 
the template genomic DNA using the polymerase chain reaction (PCR, U.£. Patent Nos. 4,683,195 \ 
35 and 4,683,202, incorporated herein by reference). This PCR reaction uses primers that anneal 

358 



t 



WO 2004/078994 PCT/US2004/006337 

approximately 130 bases upstream and downstream of each loci of interest. The primers are mixed 
together and are used in a single reaction to amplify the template DNA. This reaction is done to 
increase the number of copies of the loci of interest, which eliminates error generated from a low 
number of genomes. 

5 For increased specificity, a "hot-start" PCR reaction is used. PCR reactions are performed 

using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). The amount 
of template DNA and primer per reaction is optimized for each locus of interest. In this example, 
the 20 |il of plasma template DNA is used. 

Two microliters of each forward and reverse primer, at concentrations of 5 mM are pooled 

10 into a single microcentrifuge tube and mixed. Four microliters of the primer mix is used in a total 
PCR reaction volume of 50 \il (20^1 of template plasma DNA, 1 jlxI of sterile water, 4 ^1 of primer 
mix, and 25 jal of HotStar Taq. Twenty-five cycles of PCR are performed. The following PCR 
conditions are used: 

(1) 95°C for 15 minutes; 

15 (2) 95°C for 30 second; 

(3) 4°C for 30 seconds; 

(4) 37°C for 30 seconds; 

(5) Repeat steps 2-4 twenty-four (24) times; 

(6) 72°C for 10 minutes. 

20 The temperatures and times for denaturing, annealing, and extension, are optimized by 

trying various settings and using the parameters that yield the best results. 

Other methods of genomic amplification can also be used to increase the copy number of 
the loci of interest including but not limited to primer extension preamplification (PEP) (Zhang et 
al 9 PNAS, 89:5847-51, 1992), degenerate oligonucleotide primed PCR (DOP-PCR) (Telenius, et 

25 al, Genomics 13:718-25, 1992), strand displacement amplification using DNA polymerase from 
bacteriophage 29, which undergoes rolling circle replication (Dean et al 9 Genomic Research 
11:1095-99, 2001), multiple displacement amplification (U.S. Patent 6,124,120), REPLI-g™ Whole 
Genome Amplification kits, and Tagged PCR. 

It is important to ensure that the region amplified contains annealing sequences for the PCR 

30 primers in the targeted SNP discovery, hMC method 

Purification of Fragment of Interest 

The unused primers, and nucleotides are removed from the reaction by using Qiagen 
MinElute PCR purification kits (Qiagen, Catalog Number 28004). The reactions are performed 
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following the manufacturer's instructions supplied with the columns. The DNA is eluted in 100 \il 
of sterile water. 5 (il of each amplified loci is mixed together 

Targeted SNP Discovery: hMC Method 

5 The pooled DNA is assayed with the hMC method as described above. Each SNP is 

genotyped. SNPs located on chromosomes 13 and 21, wherein the maternal DNA is homozygous, 
and DNA isolated from the plasma is heterozygous are quantitated. However, SNPs located on 
other chromosomes can also be quantitated if so desired. 

10 Quantification 

The intensity of each peak, wherein each peak corresponds to a DNA fragment with a 
specific molecular weight, is quantitated. As discussed above, the expected ratio of allele 1 to allele 
2 is used to determine the presence or absence of a chromosomal abnormality. If the maternal 
genome is homozygous at SNP X (A/A), and the plasma DNA is heterozygous at SNP X (A/G), 
1 5 then the G represents the distinct fetal signal. 

There will be some fragments that differ in molecular due to the presence of the G 
nucleotide at SNP X in the fetal genome. The intensity of the peak with the A nucleotide is 
quantitated and the intensity of the peak that corresponds to fragments with the G nucleotide is 
quantitated. The ratio of G:A depends on the percentage of fetal DNA present in the maternal 
20 blood, 

For example, if the sample contains 50% fetal DNA, then the expected ratio is 0.33 (1 fetal 
G allele/(2 maternal A alleles + 1 fetal A allele)). This ratio should be constant for all 
chromosomes that are present in two copies. The ratio that is obtained for SNPs on chromosome 13 
should be the same as the ratio that is obtained for chromosome 2 1 . 

25 However, if the fetal genome contains an additional copy of chromosome 21, then the ratio 

for this chromosome will deviate from the expected ratio. The expected ratio for a Trisomy 
condition with 50% fetal DNA in the maternal blood is 0,25. Thus, by analyzing SNPs wherein the 
maternal genome is homozygous, and the DNA that is isolated from the plasma is heterozygous, 
fetal chromosomal abnormalities can be detected. 

30 This example explained the use of Sequenom's hMC method, but it not intended to limit 

the use of other mass spectrometry techniques. 

EXAMPLE 20 
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Fetal chromosomal abnormalities are determined by analyzing SNPs wherein the maternal 
template DNA is homozygous and the template DNA obtained from the plasma is heterozygous. 
Plasma that is isolated from blood of a pregnant female contains both maternal template DNA and 
fetal template DNA. Any number of SNP detection methods can be used to analyze the maternal 
5 and plasma DNA, In this example, SNPs are analyzed using Sequenom's Mass Array™ 
Homogenous MassEXTEND™ (hME) Assay. 

Collection of Blood Samples 

In accordance with an IRB approved study, blood samples are collected from pregnant 

10 women after informed consent is granted. Blood is collected into 9 ml EDTA Vacuette tubes 
(catalog number NC9897284) and 0.225 ml of 10% neutral buffered solution containing 
formaldehyde (4% w/v), is added to each tube, and each tube gently is inverted. The tubes are 
stored at 4°C until ready for processing. 

Any number of agents that impede cell lysis or stabilize cell membranes can be added to the 

15 tubes including but not limited to formaldehyde, and derivatives of formaldehyde, formalin, 

glutaraldehyde, and derivatives of glutaraldehyde, crosslinkers, primary amine reactive crosslinkers, 
sulfhydryl reactive crosslinkers, sulfhydryl addition or disulfide reduction, carbohydrate reactive 
crosslinkers, carboxyl reactive crosslinkers, photoreactive crosslinkers, cleavable crosslinkers, 
AEDP, APG, BASED, BM(PEO) 3 , BM(PEO) 4 , BMB, BMDB, BMH, BMOE, BS3, BSOCOES, 

20 DFDNB, DMA, DMP, DMS, DPDPB, DSG, DSP, DSS, DST, DTBP, DTME, DTSSP, EGS, 
HBVS, sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS or compounds listed in Table XXm. Any 
concentration of agent that stabilizes cell membranes or impedes cell lysis can be added. In a 
preferred embodiment, the agent that stabilizes cell membranes or impedes cell lysis is added at a 
concentration that does not impede or hinder subsequent reactions. 

25 An agent that stabilizes cell membranes may be added to the maternal blood sample to 

reduce maternal cell lysis including but not limited to aldehydes, urea formaldehyde, phenol 
formaldehyde, DMAE (dimethylaminoethanol), cholesterol, cholesterol derivatives, high 
concentrations of magnesium, vitamin E, and vitamin E derivatives, calcium, calcium gluconate, 
taurine, niacin, hydroxylamine derivatives, bimoclomol, sucrose, astaxanthin, glucose, 

30 amitriptyline, isomer A hopane tetral phenylacetate, isomer B hopane tetral phenylacetate, 

citicoline, inositol, vitamin B, vitamin B complex, cholesterol hemisuccinate, sorbitol, calcium, 
coenzyme Q, ubiquinone, vitamin K, vitamin K complex, menaquinone, zonegran, zinc, ginkgo 
biloba extract, diphenylhydantoin, perftoran, polyvinylpyrrolidone, phosphatidylserine, tegretol, 
PABA, disodium cromglycate, nedocromil sodium, phenytoin, zinc citrate, mexitil, dilantin, sodium 

35 hyaluronate, or polaxamer 188. 
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Isolation of Plasma and Maternal Cells 

The blood is stored at 4°C until processing. The tubes are spun at 1000 rpm for ten minutes 
in a centrifuge with braking power set at zero. The tubes are spun a second time at 1000 rpm for 
5 ten minutes. The supernatant (the plasma) of each sample is transferred to a new tube and spun at 
3000 rpm for ten minutes with the brake set at zero. The supernatant is transferred to a new tube 
and stored at -80°C. Approximately two milliliters of the "buffy coat," which contains maternal 
cells, is placed into a separate tube and stored at -80°C. 

Isolation of DNA 

DNA is isolated from the plasma sample using the Qiagen Midi Kit for purification of DNA 
from blood cells, following the manufacturer's instructions (QIAmp DNA Blood Midi Kit, Catalog 
number 5 1 1 83). DNA is eluted in 1 00 \il of distilled water. The Qiagen Midi Kit also is used to 
isolate DNA from the maternal cells contained in the "buffy coat." 

Identification of Homozygous Maternal SNPs 

MassARRAY Homogenous MassEXTEND™ (hME) Assay 
The Homogenous MassEXTEND™ (hME) Assay uses a beadless, label-free primer 
20 extension chemistry for genotyping. Each of the primer products has a unique molecular weight 
that allows the associated genotype to be precisely identified using mass spectrometry. 



10 



15 



Template Amplification 

The isolated maternal DNA is amplified (2.5 ng) in a 5 \sl volume using a 384-microtiter 
25 plate format. Any number of SNPs can be amplified, either in a single reaction or in multiple 
reactions. Representative primers that are used to amplify SNP TSC0271628 (A/G), which is 
located on chromosome 21, are provided below: 

Upstream Primer: 
30 5* AGGAAATTGTGAAGTA 3' 



Downstream Primer: 

5' TAACTCACTCACTATC 3' 
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The primers can be longer or shorter in nucleotide sequence. PCR conditions recommended by the 
makers of MassARRAY Homogenous MassEXTEND Assay are followed. Representative PCR 
conditions are provided below: 

5 (1) 95°C for 15 minutes and 15 seconds; 

(2) 95°C for 30 seconds; 

(4) 57°C for 30 seconds; 

(5) 72°C for 30 seconds; 

(6) Repeat steps 2-5 thirty two (32) times; 
10 (7) 72°C for 5 minutes. 



Dephosphorylation 

Arctic shrimp alkaline phosphatase is added to the samples, which are then incubated at 
37°C for 20 minutes. This step is done to dephosphoiylate any remaining nucleotides, which 
1 5 prevents their future incorporation and interference with MassARRAY Homogenous 

MassEXTEND Assay. Samples are then incubated at 85°C to inactivate the heat-labile SAP. 



hME Reaction 

A MassEXTEND primer is designed to anneal close to the polymorphic site, and is 
20 designed to identify both alleles of the polymorphic site. For SNP TSC0271 628, a representative 
MassEXTEND primer is: 



5' CTTTTTATGCCTTTCCACTCATCCA 3' 



25 The length of the MassEXTEND primer is designed according to the instructions provided by the 
makers of the MassARRAY Homogenous MassEXTEND Assay. 

The MAssEXTEND primer, DNA polymerase, and a cocktail mixture of deoxynucleotides 
(dNTPs) and dideoxynucleotides (ddNTPs) are added to the initial primer extension reaction. 
Allele-specific primer products are generated that are generally one to four bases longer than the 

30 original MassEXTEND primer. 

A MASSEXTEND primer is hybridized closely adjacent to the polymorphic site following 
the conditions.recommended by the makers of the MassARRAY Homogenous MassEXTEND 
Assay. Nucleotide mixtures are selected to maximize mass differences for all possible 
MassEXTEND products. Appropriate dNTPS are incorporated until a single ddNTP is 
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incorporated, and the reaction is terminated. The manufacturer's protocols are followed for all 
steps of the hME assay. 

Representative reaction products for SNP TSC0271628 are provided below: 

A allele Before Primer Extension 

MassEXTEND primer: CT TTTT ATGCCT T TCCACTCATCCA 

Sample DNA: GAAAAAT ACGGAAAGGTGAGTAGGTTTCC 

The SNP site is identified in bold. After incubation with DNA polymerase, ddATP, dCTP, dGTP, 
and dTTP, the following product is generated: 

A allele After Primer Extension 

MassEXTEND primer: CT TTTT ATGCCT T TCCACTCATCCAA* 

Sample DNA: GAAAAATACGGAAAGGTGAGTAGGTTTCC 

ddATP is incorporated into the primer. Either labeled or unlabeled ddNTPs can be used. The 
asterisk indicates ddATP that is unlabeled. After the incorporation reaction, a 24-mer primer is 
generated. 

G allele Before Primer Extension 

MassEXTEND primer: CT TTTT ATGCCT T TCCACTCATCCA 

Sample DNA: GAAAAATACGGAAAGGTGAGTAGGTCTCC 

The SNP site is identified in bold. After incubation with DNA polymerase, ddATP, dCTP, dGTP, 
and dTTP, the following product is generated: 

G allele After Primer Extension 

MassEXTEND primer: CT TTTT ATGCCT T TCCACTCATCCA,GA* 

Sample DNA: GAAAAATACGGAAAGGTGAGTAGGTCTCC 
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After the incorporation reaction, a 25-mer primer is generated. The difference in molecular weight 
between the reaction product for the A allele (24-mer) and the reaction product for the G allele (25- 
mer) is used to genotype the locus of interest. 

5 Sample Conditioning 

SpectroCLEAN™ resin is added to the reaction to remove extraneous salts that interfere 
with MALDI-TOF analysis. 

Sample Transfer 

10 15 nl of sample is transferred from the 384-microtiter plate and spotted onto the pad of the 

384 SpectroCHIP™ microarray. 

i 

Sample Analysis 

The SpectroCHIP™ is placed into the MALDI-TOF, which measures the mass of the 
15 extension products. Once determined, the genotype is called in real-time with SpectroTYPER™ RT 
software. SNPs at which the maternal DNA are homozygous are identified, and analyzed with the 
DNA that is isolated from the plasma. 

Analysis of DNA Isolated from Maternal Plasma 

20 After the maternal DNA is analyzed and homozygous SNPs are identified, these SNPs are 

analyzed with the DNA isolated from the plasma. A low copy number of fetal genomes typically 
exists in the maternal plasma. To increase the copy number of the loci of interest, which are the 
SNPs at which the maternal DNA is homozygous, primers are designed to anneal at approximately 
130 bases upstream and 130 bases downstream of each loci of interest This is done to reduce 

25 statistical sampling error that can occur when working with a low number of genomes, which can 
influence the ratio of one allele to another (see Example 1 1). 

Design of Multiplex Primers 

The primers are 12 bases in length. However, primers of any length can be used including 
30 but not limited to 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 
26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36-45, 46-55, 56-65, 66-75, 76-85, 86-95, 96-105, 106-115, 
116-125, and greater than 125 bases. Primers are designed to anneal to both the sense strand and 
the antisense strand. 

The maternal homozygous SNPs vary from sample to sample so defined sequences are not 
35 provided here. Primers are designed to anneal about 130 bases upstream and downstream of the 

365 



WO 2004/078994 



PCT/US2004/006337 



maternal homozygous SNPs. The primers are designed to terminate at the 3' end in the dinucleotide 
"AA" to reduce the formation of primer-dimers. However, the primers can be designed to end in 
any of the four nucleotides and in any combination of the four nucleotides. 

5 Multiplex PCR 

Regions upstream and downstream of the maternal homozygous SNPs are amplified from 
the template genomic DNA using the polymerase chain reaction (PCR, U.S. Patent Nos. 4,683,195 
and 4,683,202, incorporated herein by reference). This PCR reaction uses primers that anneal 
approximately 130 bases upstream and downstream of each loci of interest. The primers are mixed 
10 together and are used in a single reaction to amplify the template DNA. This reaction is done to 
increase the number of copies of the loci of interest, which eliminates error generated from a low 
number of genomes. 

For increased specificity, a "hot-start" PCR reaction is used. PCR reactions are performed 
using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). The amount 
15 of template DNA and primer per reaction is optimized for each locus of interest. In this example, 
the 20 jal of plasma template DNA is used. 

Two microliters of each forward and reverse primer, at concentrations of 5 mM are pooled 
into a single microcentrifuge tube and mixed. Four microliters of the primer mix is used in a total 
PCR reaction volume of 50 jil (20jll1 of template plasma DNA, 1 \xl of sterile water, 4 \xl of primer 
20 mix, and 25 (il of HotStar Taq. Twenty-five cycles of PCR are performed. The following PCR 
conditions are used: 

(1) 95°C for 15 minutes; 

(2) 95°Cfor30 second; 

(3) 4°C for 30 seconds; 
25 (4) 37°C for 30 seconds; 

(5) Repeat steps 2-4 twenty-four (24) times; 

(6) 72°C for 10 minutes. 

The temperatures and times for denaturing, annealing, and extension, are optimized by 
trying various settings and using the parameters that yield the best results. 
30 Other methods of genomic amplification can also be used to increase the copy number of 

the loci of interest including but not limited to primer extension preamplification (PEP) (Zhang et 
al 9 PNAS, 89:5847-51, 1992), degenerate oligonucleotide primed PCR (DOP-PCR) (Telenius, et 
al, Genomics 13:718-25, 1992), strand displacement amplification using DNA polymerase from 
bacteriophage 29, which undergoes rolling circle replication (Dean et al., Genomic Research 
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11:1095-99, 2001), multiple displacement amplification (U.S. Patent 6,124,120), REPLI-g™ Whole 
Genome Amplification kits, and Tagged PCR. 

Purification of Fragment of Interest 

5 The unused primers, and nucleotides are removed from the reaction by using Qiagen 

MinElute PCR purification kits (Qiagen, Catalog Number 28004). The reactions are performed 
following the manufacturer's instructions supplied with the columns. The DNA is eluted in 100 jul 
of sterile water. 5 jil of each amplified loci is mixed together 

1 0 Mass ARRAY Homogenous MassEXTEND Assay 

The pooled DNA is assayed with the hME assay as described above. Each SNP is 
genotyped. SNPs located on chromosomes 13 and 21, wherein the maternal DNA is homozygous, 
and DNA isolated from the plasma is heterozygous are quantitated. 

15 Quantification 

The intensity of each peak, wherein each peak corresponds to a DNA fragment with a 
specific molecular weight, is quantitated. As discussed above, the expected ratio of allele 1 to allele 
2 is used to determine the presence or absence of a chromosomal abnormality. If the maternal 
genome is homozygous at SNP X (A/A), and the plasma DNA is heterozygous at SNP X (A/G), 

20 then the G represents the distinct fetal signal. 

There will be some fragments that differ in molecular due to the presence of the G 
nucleotide at SNP X in the fetal genome. The intensity of the peak with the A nucleotide is 
quantitated and the intensity of the peak that corresponds to fragments with the G nucleotide is 
quantitated. The ratio of G:A depends on the percentage of fetal DNA present in the maternal 

25 blood. 

For example, if the sample contains 50% fetal DNA, then the expected ratio is 0.33 (1 fetal 
G allele/(2 maternal A alleles + 1 fetal A allele)). This ratio should be constant for all 
chromosomes that are present in two copies. The ratio that is obtained for SNPs on chromosome 13 
should be the same as the ratio that is obtained for chromosome 21. 
30 However, if the fetal genome contains an additional copy of chromosome 2 1 , then the ratio 

for this chromosome will deviate from the expected ratio. The expected ratio for a Trisomy 
condition with 50% fetal DNA in the maternal blood is 0.25. Thus, by analyzing SNPs wherein the 
maternal genome is homozygous, and the DNA that is isolated from the plasma is heterozygous, 
fetal chromosomal abnormalities can be detected. 
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This example explained the use Sequenom's MassARRAY Homogenous MassEXTEND 
(hME) 'assay, but it not intended to limit the use of techniques that differentiate molecules based on 
molecular weight. 

5 EXAMPLE 21 

Fetal chromosomal abnormalities are determined by analyzing SNPs wherein the maternal 
template DNA is homozygous and the template DNA obtained from the plasma is heterozygous. 
Plasma that is isolated from blood of a pregnant female contains both maternal template DNA and 
10 fetal template DNA. Any number of SNP detection methods can be used to analyze the maternal 
and plasma DNA. In this example, SNPs are analyzed using Orchid's SNP-IT™ Assay. However, 
other SNP detection methods based on primer extension may also be used. 



Collection of Blood Samples 

1 5 In accordance with an 1KB approved study, blood samples are collected from pregnant 

women after informed consent is granted. Blood is collected into 9 ml EDTA Vacuette tubes 
(catalog number NC9897284) and 0.225 ml of 10% neutral buffered solution containing 
formaldehyde (4% w/v), is added to each tube, and each tube gently is inverted. The tubes are 
stored at 4°C until ready for processing. 

20 Any number of agents that impede cell lysis or stabilize cell membranes can be added to the 

tubes including but not limited to formaldehyde, and derivatives of formaldehyde, formalin, 
glutaraldehyde, and derivatives of glutaraldehyde, crosslinkers, primary amine reactive crosslinkers, 
sulfhydryl reactive crosslinkers, sulfhydryl addition or disulfide reduction, carbohydrate reactive 
crosslinkers, carboxyl reactive crosslinkers, photoreactive crosslinkers, cleavable crosslinkers, 

25 AEDP, APG, BASED, BM(PEO) 3 , BM(PEO) 4 , BMB, BMDB, BMH, BMOE, BS3, BSOCOES, 
DFDNB, DMA, DMP, DMS, DPDPB, DSG, DSP, DSS, DST, DTBP, DTME, DTSSP, EGS, 
HBVS, sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS or compounds listed in Table XXm. Any 
concentration of agent that stabilizes cell membranes or impedes cell lysis can be added. In a 
preferred embodiment, the agent that stabilizes cell membranes or impedes cell lysis is added at a 

30 concentration that does not impede or hinder subsequent reactions. 

An agent that stabilizes cell membranes may be added to the maternal blood sample to 
reduce maternal cell lysis including but not limited to aldehydes, urea formaldehyde, phenol 
formaldehyde, DMAE (dimethylaminoethanol), cholesterol, cholesterol derivatives, high 
concentrations of magnesium, vitamin E, and vitamin E derivatives, calcium, calcium gluconate, 

35 taurine, niacin, hydroxylamine derivatives, bimoclomol, sucrose, astaxanthin, glucose, 
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amitriptyline, isomer A hopane tetral phenylacetate, isomer B hopane tetral phenylacetate, 
citicoline, inositol, vitamin B, vitamin B complex, cholesterol hemisuccinate, sorbitol, calcium, 
coenzyme Q, ubiquinone, vitamin K, vitamin K complex, menaquinone, zonegran, zinc, ginkgo 
biloba extract, diphenylhydantoin, perftoran, polyvinylpyrrolidone, phosphatidylserine, tegretol, 
5 PABA, disodium cromglycate, nedocromil sodium, phenytoin, zinc citrate, mexitil, dilantin, sodium 
hyaluronate, or polaxamer 188. 

Isolation of Plasma and Maternal Cells 

The blood is stored at 4°C until processing. The tubes are spun at 1000 rpm for ten minutes 
10 in a centrifuge with braking power set at zero. The tubes are spun a second time at 1000 rpm for 
ten minutes. The supernatant (the plasma) of each sample is transferred to a new tube and spun at 
3000 rpm for ten minutes with the brake set at zero. The supernatant is transferred to a new tube 
and stored at -80°C. Approximately two milliliters of the "buffy coat," which contains maternal 
cells, is placed into a separate tube and stored at -80°C. 

15 

Isolation of DNA 

DNA is isolated from the plasma sample using the Qiagen Midi Kit for purification of DNA 
from blood cells, following the manufacturer's instructions (QIAmp DNA Blood Midi Kit, Catalog 
number 51 183). DNA is eluted in 100 of distilled water. The Qiagen Midi Kit also is used to 
20 isolate DNA from the maternal cells contained in the "buffy coat." 

Identification of Homozygous Maternal SNPs 

SNP-TT™ Assay 

25 The SNP-IT Assay is based on a single base primer extension. Prior to the SNP-IT Assay, 

a PCR product that includes the SNP of interest is prepared, using one unmodified and one 

i 

phosphorothiolate modified primer. The PCR product is then rendered single stranded using 
exonuclease, and the single stranded DNA is annealed to a SNP-IT oligonucleotide immobilized on 
the surface of a 96-well microtiter plate. Following hybridization, single base extension occurs by 
30 the addition of DNA polymerase and the labeled terminators. The incorporated base is detected 
using antibodies specific to the label followed by colorimetric detection. Data analysis can be done 
visually or with the use of the absorbance plate reader. 

Primer Design 
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For each locus of interest, the SNP-IT™ Assay requires three primers. The primers are 
designed to produce an amplicon of 100-50 base pairs. The sequence of the SNP-IT primer, which 
is designed to anneal immediately upstream of the SNP site, is the best sequence available from 
between the upper and the lower strands. The sequence qf the SNP-IT primer is designed to 
minimize hybridization to self and other sites in the amplicon. In addition, the SNP-IT primer may 
contain modified bases to prevent self-priming. The length of the primers is designed according to 
the makers of the SNP-IT Assay. 

Representative primers for the amplification and genotyping of SNP TSC0069085, which is 
located on chromosome 21, are provided below: 

Upstream Primer: 

5' ATCACACTGGGGATC 3' 

Downstream Primer: 

5' CTAAACCTATGACTC 3 ' 

SNP-IT primer 

5' TTCACAGAGGATATCTTAATA 3' 

The upstream primer is unmodified and the downstream primer is phosphorothiolate 
modified. 

SNP-IT Plate Coating 

SNP-IT primer is added to coat wells of empty 96-well plates. This reaction typically 
incubates overnight The manufacturer's protocols and procedures are followed. 

PCR 

Template DNA (1 5 ng) is amplified either in a reaction vessel including but not limited to 
an eppendorf tube or a well of a microtiter plate. The manufacturer's protocols and procedures are 
followed for the PCR reaction. 

Exonuclease 



370 



WO 2004/078994 



PCT/US2004/006337 



PCR product is treated with exonuclease to degrade the unmodified strand. The protected 
phosphorothiolate-labeled strand is used in the SNP-IT Assay. The manufacturer's protocols and 
procedures are followed for the exonuclease reaction. 

Annealing 

Single stranded PCR product is transferred to SNP-IT plate and is allowed to form a hybrid 
with the SNP-IT primer. The annealing reaction typically proceeds for one hour. The 
manufacturer's protocols and procedures are followed for the annealing reaction. 

SNP-IT Reaction 

The extension reagent, which contains DNA polymerase, two terminating nucleotides 
labeled with either fluorescein or biotin and two unlabeled terminators, is added to the SNP-IT well 
containing the annealed template and primer complex. For SNP TSC0069085, ddCTP is labeled 
with fluorescein and ddTTP is labeled with biotin, and the unlabeled terminators are ddATP, and 
ddGTP. The manufacturer's protocols and procedures are followed for the extension reaction. 

The SNP specific base is incorporated by single base extension of the SNP-IT primer. 
Primers are washed manually or in a plate washer to remove unincorporated material. The 
manufacturer's protocols and procedures are followed for the washing reaction. 

Detection 

Anti-fluorescein labeled with alkaline phosphatase (AP) is added to the plate and allowed to 
bind to any incorporated fluorescein labeled terminator. The manufacturer's protocols and 
procedures for the labeling reaction are followed. 

The plates are washed, and then color development is performed using pNPP as the 
detection substrate. The absorbance is read at 405 nm to detect yellow colored pNPP substrate 
followed by a washing step to remove pNPP detection reagents. The manufacturer's protocols and 
procedures are followed for color development and washing steps. 

Streptavidin labeled with horse radish peroxidase (HRP) is added to the plate and allowed 
to bind to any incorporated biotin labeled terminator. The manufacturer's protocols and procedures 
are followed for the labeling reaction. 

Following washing, color development is performed using TMB as the detection substrate. 
The absorbance is read at 620 nm to detect blue colored TMB substrate. 

Analysis 
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Absorbance is plotted to generate a scather plot from which genotype calls are made. SNPs 
at which the maternal DNA is homozygous are identified, and analyzed with the DNA isolated from 
the maternal plasma. 

5 Analysis of DNA Isolated from Maternal Plasma 

After the maternal DNA is analyzed and homozygous SNPs are identified, these SNPs are 
analyzed with the DNA isolated from the plasma. A low copy number of fetal genomes typically 
exists in the maternal plasma. To increase the copy number of the loci of interest, which are the 
SNPs at which the maternal DNA is homozygous, primers are designed to anneal at approximately 
10 130 bases upstream and 130 bases downstream of each loci of interest. This is done to reduce 
statistical sampling error that can occur when working with a low number of genomes, which can 
influence the ratio of one allele to another (see Example 11). 

Design of Multiplex Primers 

15 The primers are 12 bases in length. However, primers of any length can be used including 

but not limited to 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 
26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36-45, 46-55, 56-65, 66-75, 76-85, 86-95, 96-105, 106-115, 
116-125, and greater than 125 bases. Primers are designed to anneal to both the sense strand and 
the antisense strand. 

20 The maternal homozygous SNPs vary from sample to sample so defined sequences are not 

provided here. Primers are designed to anneal about 130 bases upstream and downstream of the 
maternal homozygous SNPs. The primers are designed to terminate at the 3' end in the dinucleotide 
"AA" to reduce the formation of primer-dimers. However, the primers can be designed to end in 
any of the four nucleotides and in any combination of the four nucleotides. 

25 

Multiplex PCR 

Regions upstream and downstream of the maternal homozygous SNPs are amplified from 
the template genomic DNA using the polymerase chain reaction (PCR, U.S. Patent Nos. 4,683,195 
and 4,683,202, incorporated herein by reference). This PCR reaction uses primers that anneal 
30 approximately 130 bases upstream and downstream of each loci of interest. The primers are mixed 
together and are used in a single reaction to amplify the template DNA. This reaction is done to 
increase the number of copies of the loci of interest, which eliminates error generated from a low 
number of genomes. 

For increased specificity, a "hot-start" PCR reaction is used. PCR reactions are performed 
35 using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). The amount 
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of template DNA and primer per reaction is optimized for each locus of interest. In this example, 
the 20 ^1 of plasma template DNA is used. 

Two microliters of each forward and reverse primer, at concentrations of 5 mM are pooled 
into a single microcentrifuge tube and mixed. Four microliters of the primer mix is used in a total 
5 PCR reaction volume of 50 \il (20jal of template plasma DNA, 1 jil of sterile water, 4 of primer 
mix, and 25 fil of HotStar Taq. Twenty-five cycles of PCR are performed. The following PCR 
conditions are used: 

(1) 95°C for 15 minutes; 

(2) 95°Cfor30 second; 
10 (3) 4°C for 30 seconds; 

(4) 37°Cfor 30 seconds; 

(5) Repeat steps 2-4 twenty-four (24) times; 

(6) 72°C for 10 minutes. 

The temperatures and times for denaturing, annealing, and extension, are optimized by 
15 trying various settings and using the parameters that yield the best results. 

Other methods of genomic amplification can also be used to increase the copy number of 
the loci of interest including but not limited to primer extension preamplification (PEP) (Zhang et 
al, PNAS, 89:5847-51, 1992), degenerate oligonucleotide primed PCR (DOP-PCR) (Telenius, et 
al 9 Genomics 13:718-25, 1992), strand displacement amplification using DNA polymerase from 
20 bacteriophage 29, which undergoes rolling circle replication (Dean et aJ. 3 Genomic Research 
11:1095-99, 2001), multiple displacement amplification (U.S. Patent 6,124,120), REPLI-g™ Whole 
Genome Amplification kits, and Tagged PCR. 

Purification of Fragment of Interest 
25 The unused primers, and nucleotides are removed from the reaction by using Qiagen 

MinElute PCR purification kits (Qiagen, Catalog Number 28004). The reactions are performed 
following the manufacturer's instructions supplied with the columns. The DNA is eluted in 100 jil 
of sterile water. 5 jil of each amplified loci is mixed together 

30 SNP-JT Assay 

The pooled DNA is assayed with the SNP-IT assay as described above. Each SNP is 
genotyped. SNPs located on chromosomes 13 and 21, wherein the maternal DNA is homozygous, 
and DNA isolated from the plasma is heterozygous are quantitated. 

35 Quantification 
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The fluorescence intensity of each allele is quantitated. As discussed above, the expected 
ratio of allele 1 to allele 2 is used to determine the presence or absence of a chromosomal 
abnormality. If the maternal genome is homozygous at SNP X (A/A), and the plasma DNA is 
heterozygous at SNP X (A/G), then the G represents the distinct fetal signal. 
5 The intensity of the allele with the A nucleotide is quantitated and the intensity of the allele 

with the G nucleotide is quantitated. The ratio of G:A depends on the percentage of fetal DNA 
present in the maternal blood. 

For example, if the sample contains 50% fetal DNA, then the expected ratio is 0.33 (1 fetal 
G allele/(2 maternal A alleles + 1 fetal A allele)). This ratio should be constant for all 
10 chromosomes that are present in two copies. The ratio that is obtained for SNPs on chromosome 13 
should be the same as the ratio that is obtained for chromosome 21. 

However, if the fetal genome contains an additional copy of chromosome 21, then the ratio 
for this chromosome will deviate from the expected ratio. The expected ratio for a Trisomy 
condition with 50% fetal DNA in the maternal blood is 0.25. Thus, by analyzing SNPs wherein the 
15 maternal genome is homozygous, and the DNA that is isolated from the plasma is heterozygous, 
fetal chromosomal abnormalities can be detected. 

In this example, the terminator nucleotides are labeled with different chemical moieties. 
However, using the methods described in this application (see Example 6), the SNP-IT assay could 
be modified to allow detection of both alleles with a single labeled terminator. 
20 This example explained the use of Orchid's SNP-IT assay, but it not intended to limit the 

use of other techniques that rely on primer extension. Orchid's SNPstream 25K, as well 
accompanying software including but not limited to GetGenos™, QCreview™, and ValidGenos™, 
can also be used to detect the presence of chromosomal abnormalities in the maternal blood. 
Additional information about these products can be found at: 
25 http://www.orchidbio.com/products/lsfi/products/snpstream.asp . 

EXAMPLE 22 

30 Fetal chromosomal abnormalities are determined by analyzing SNPs wherein the maternal 

template DNA is homozygous and the template DNA obtained from the plasma is heterozygous. 
Plasma that is isolated from blood of a pregnant female contains both maternal template DNA and 
fetal template DNA. Any number of SNP detection methods can be used to analyze the maternal 
and plasma DNA. In this example, SNPs are analyzed using the TaqMan® assay. However, other 

35 methods that rely on fluorogenic 5' nuclease assay can be used. 
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Collection of Blood Samples 

In accordance with an IRB approved study, blood samples are collected from pregnant 
women after informed consent is granted. Blood is collected into 9 ml EDTA Vacuette tubes 
5 (catalog number NC9897284) and 0.225 ml of 10% neutral buffered solution containing 

formaldehyde (4% w/v), is added to each tube, and each tube gently is inverted. The tubes are 
stored at 4°C until ready for processing. 

Any number of agents that impede cell lysis or stabilize cell membranes can be added to the 
tubes including but not limited to formaldehyde, and derivatives of formaldehyde, formalin, 

1 0 glutaraldehyde, and derivatives of glutaraldehyde, crosslinkers, primary amine reactive crosslinkers, 
sulfhydryl reactive crosslinkers, sulfhydryl addition or disulfide reduction, carbohydrate reactive 
crosslinkers, carboxyl reactive crosslinkers, photoreactive crosslinkers, cleavable crosslinkers, 
AEDP, APG, BASED, BM(PEO)3, BM(PEO) 4 , BMB, BMDB, BMH, BMOE, BS3, BSOCOES, 
DFDNB, DMA, DMP, DMS, DPDPB, DSG, DSP, DSS, DST, DTBP, DTME, DTSSP, EGS, 

15 HBVS, sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS or compounds listed in Table XXm. Any 
concentration of agent that stabilizes cell membranes or impedes cell lysis can be added. In a 
preferred embodiment, the agent that stabilizes cell membranes or impedes cell lysis is added at a 
concentration that does not impede or hinder subsequent reactions. 

An agent that stabilizes cell membranes may be added to the maternal blood sample to 

20 reduce maternal cell lysis including but not limited to aldehydes, urea formaldehyde, phenol 
formaldehyde, DMAE (dimethylaminoethanol), cholesterol, cholesterol derivatives, high 
concentrations of magnesium, vitamin E, and vitamin E derivatives, calcium, calcium gluconate, 
taurine, niacin, hydroxylamine derivatives, bimoclomol, sucrose, astaxanthin, glucose, 
amitriptyline, isomer A hopane tetral phenylacetate, isomer B hopane tetral phenylacetate, 

25 citicoline, inositol, vitamin B, vitamin B complex, cholesterol hemisuccinate, sorbitol, calcium, 
coenzyme Q, ubiquinone, vitamin K, vitamin K complex, menaquinone, zonegran, zinc, ginkgo 
biloba extract, diphenylhydantoin, perftoran, polyvinylpyrrolidone, phosphatidylserine, tegretol, 
PABA, disodium cromglycate, nedocromil sodium, phenytoin, zinc citrate, mexitil, dilantin, sodium 
hyaluronate, or polaxamer 188. 

30 

Isolation of Plasma and Maternal Cells 

The blood is stored at 4°C until processing. The tubes are spun at 1000 rpm for ten minutes 
in a centrifuge with braking power set at zero. The tubes are spun a second time at 1 000 rpm for 
ten minutes. The supernatant (the plasma) of each sample is transferred to a new tube and spun at 

375 



WO 2004/078994 



PCT/US2004/006337 



3000 rpm for ten minutes with the brake set at zero. The supernatant is transferred to a new tube 
and stored at -80°C. Approximately two milliliters of the "bufify coat," which contains maternal 
cells, is placed into a separate tube and stored at -80°C. 

5 Isolation of DNA 

DNA is isolated from the plasma sample using the Qiagen Midi Kit for purification of DNA 
from blood cells, following the manufacturer's instructions (QIAmp DNA Blood Midi Kit, Catalog 
number 51 183). DNA is eluted in 100 jal of distilled water. The Qiagen Midi Kit also is used to 
isolate DNA from the maternal cells contained in the "buffy coat" 

10 

Identification of Homozygous Maternal SNPs 
TaqMan Assay 

PE Biosystems has two instruments in its Sequence Detection Systems product line, the 
ABI Prism® 7700 Sequence Detection System and the GeneAmp® 5700 Sequence Detection 

1 5 System. These real-time systems allegedly are capable of detecting PCR products as they 
accumulate during PCR and so enable the quantitation of DNA in the sample. 

One chemistry available for use on the ABI PRISM® 7700 and GeneAmp® 5700 detection 
systems is the fluorogenic 5' nuclease assay, or the TaqMan® assay, which uses a fluorogenic 
probe to enable the detection of a specific PCR product as it accumulates during PCR. PE 

20 Biosystems' patented fluorogenic probe design that incorporates the reporter due on the 5' end and 
the quencher on the 3 9 end has assisted with the design of TaqMan probes. 

The basis for PCR quantitation in the ABI 7700 instrument is to continuously measure PCR 
product accumulation using a dual-labeled flourogenic oligonucleotide probe called a TaqMan® 
probe, which is composed of a short (20-25 bases) oligodeoxynucleotide that is labeled with two 

25 different fluorescent dyes. On the 5' terminus is a reporter dye and on the 3 ' terminus is a 

quenching dye. This oligonucleotide probe sequence is homologous to an internal sequence present 
in the PCR amplicon. When the probe is intact, energy transfer occurs between the two fluorophors 
and emission from the reporter is quenched by the quencher (Livak et al. 9 PCR Methods and 
Applications, 4:357-362, 1995a; U.S. Patent No. 5,538,848; U.S. Patent No. 5,723,591). 

30 During the extension phase of PCR, the probe is cleaved by 5' nuclease activity of Taq 

polymerase thereby releasing the reporter from the oligonucleotide-quencher and producing an 
increase in reporter emission intensity. The ABI Prism 7700 uses fiber optic systems which 
connect to each well in a 96-well PCR tray format. The laser light source excites each well and a 
CCD camera measures the fluorescence spectrum and intensity from each well to generate real-time 

35 data during PCR amplification. The ABI 7700 Prism software examines the fluorescence intensity 
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of reporter and quencher dyes and calculates the increase in normalized reporter emission intensity 
over the course of the amplification. The results are then plotted versus time, represented by cycle 
number, to produce a continuous measure of PGR amplification. To provide precise quantification 
of initial target in each PCR reaction, the amplification plot is examined at a point during the early 
5 log phase of product accumulation. This is accomplished by assigning a fluorescence threshold 
above background and determining the time point at which each sample's amplification plot reaches 
the threshold (defined as the threshold cycle number or CT). Differences in threshold cycle number 
are used to quantify the relative amount of PCR target contained within each tube as described 
previously. 

10 For SNP analysis, a TaqMan probe can be designed for each allele of the SNP. The 

reporter emission is used to determine the presence or absence of each allele at the SNP. For 
example, for a SNP that can either be adenine or guanine, a TaqMan probe will be designed with a 
complementary nucleotide to the adenine and a separate TaqMan probe will be designed with a 
complementary nucleotide to the guanine. The two TaqMan probes can be used in separate reaction 

15 vessels, which allows the amount of the adenine allele and the amount of the guanine allele to be 
calculated. 



Primer and Probe Design 

Primer and probes can be designed using the Primer Express® software. The probe is 
20 designed first, and then the primers are designed as close as possible to the probe without 
overlapping it. Amplicons of 50-150 base pairs are strongly recommended. 

The primer and probes should be designed following the manufacturer's recommendations. 
For both the primer and the probes, the G/C is in the range of 20-80%. The primer and probes are 
designed to avoid runs of an identical nucleotide. This is especially true for guanine, where runs of 
25 four or more Gs should be avoided. 

For the probe, the TM is about 68-70°C, and is designed so that there is no guanine on the 
5' end. Also, the probe is designed so that there are more C than G bases. 

For the primers, the TM is about 58-60°C, and the primers are designed so that the five 
nucleotides at the 3 ' end have no more than 2 G and/or C bases. 
30 For example, representative primers and probes for SNP TSC0271628 (A/G), which is 

located on chromosome 21, are provided below: 



Forward Primer (T M of 60°C) 
35 5' AGTCTTGTAATACGACAGTCTT 3' 
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Reverse Primer (T M of 58°C) 

5' CCATATCAATCAGTACTCTTG 3' 

TaqMan Probe A allele (T M of 68°Q bold indicates variable nucleotide at SNP) 

5' CCTTTCCACTCATCCAAAGGTTG 3' 

1 0 TaqMan Probe G allele (T M of 70°C; bold indicates variable nucleotide at SNP) 

5' CCTTTCCACTCATCCAGAGGTTG 3' 

Information regarding the sequence surrounding SNPs is found at: http://www.snp.cshl.org . By 
15 independently varying forward and reverse primer concentrations, the concentrations that provide 
optimal assay conditions can be identified. Primer concentration ranges of 50 nM - 900 nM are 
tested. 

If the maternal DNA is homozygous for one allele, for example, adenine, then in the sample 
that contains the TaqMan probe specific for the guanine nucleotide, the reporter is not separated 
20 from the quencher because the TaqMan probe does not anneal to the template DNA. However, if 
the maternal DNA is homozygous, then the reporter will be separated from the quencher in both 
samples containing the TaqMan probe specific for the guanine allele and samples containing the 
TaqMan probe specific for the adenine allele. 



25 Reagent Solution 

The polymerase recommended for the TaqMan Assay is AmpliTaq Gold DNA polymerase. 
It is thought that the use of AmpliTaq Gold DNA polymerase reduces the amount of non-specific 
product formation. The incorporation of AmpErase® Uracil n-glycosylase (UNG) and dUTP 
provide protection against PCR carryover contamination. For PCR reactions, the TaqMan 
30 Universal PCR Master mix, which is a reagent designed to provide optimal performance for 
TaqMan assays, is recommended by the manufacturer. 

The TaqMan reaction buffer contains 5.5 mM MgC12, 200 nM each of dATP, dCTP, 
dGTP, 400 nM dUTP, 0.5 U of uracyl DNA glycosylase, and 1.25 U of AmpliTaq gold. 



3 5 Thermal Cycling Parameters 
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PCR amplification and detection for all primer-probe combinations are performed with the 
ABI 7700 Sequence Detection System. The recommended cycling parameters for the TaqMan 
assay are provided below: 

1) 50°Cfor2min; 

2) 95°Cfor lOmin; 

3) 95°C for 15 sec; 

4) 60°C for 1 min; 

5) Repeat steps 3-4 for 40 cycles. 



TaqMan Quantitation 

External standards are generated from know quantities of DNA containing an adenine 
nucleotide at SNP TSC0271628 and a guanine nucleotide at SNP TSC0271628, spanning 6 orders 
of magnitude (from 5 x 10° to 5 x 10 6 copies). The detection threshold is set at 10 times the 
15 standard deviation of the mean baseline emission calculated for PCR cycles 3 to 15 (Stults et al 9 
Applied and Environmental Microbiology, Vol. 67, No. 6, 2781-2789, 2001). Standard curves 
relating the threshold cycle to DNA concentrations are generated with the ABI Prism 7700 software 
(available from Perkin Elmer). 

SNPs at which the maternal DNA is homozygous are identified, and are analyzed with the 
20 DNA isolated from the plasma sample. 



Analysis of DNA Isolated from Maternal Plasma 

After the maternal DNA is analyzed and homozygous SNPs are identified, these SNPs are 
analyzed with the DNA isolated from the plasma. A low copy number of fetal genomes typically 
25 exists in the maternal plasma. To increase the copy number of the loci of interest, which are the 
SNPs at which the maternal DNA is homozygous, primers are designed to anneal at approximately 
130 bases upstream and 130 bases downstream of each loci of interest. This is done to reduce 
statistical sampling error that can occur when working with a low number of genomes, which can 
influence the ratio of one allele to another (see Example 11). 

30 

Design of Multiplex Primers 

The primers are 12 bases in length. However, primers of any length can be used including 
but not limited to 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 
26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36-45, 46-55, 56-65, 66-75, 76-8.5, 86-95, 96-105, 106-115, 
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1 16-125, and greater than 125 bases. Primers are designed to anneal to both the sense strand and 
the antisense strand. 

The maternal homozygous SNPs vary from sample to sample so defined sequences are not 
provided here. Primers are designed to anneal about 130 bases upstream and downstream of the 
5 maternal homozygous SNPs. The primers are designed to terminate at the 3* end in the dinucleotide 
"AA" to reduce the formation of primer-dimers. However, the primers can be designed to end in 
any of the four nucleotides and in any combination of the four nucleotides. 

Multiplex PCR 

10 Regions upstream and downstream of the maternal homozygous SNPs are amplified from 

the template genomic DNA using the polymerase chain reaction (PCR, U.S. Patent Nos. 4,683,195 
and 4,683,202, incorporated herein by reference). This PCR reaction uses primers that anneal 
approximately 130 bases upstream and downstream of each loci of interest. The primers are mixed 
together and are used in a single reaction to amplify the template DNA. This reaction is done to 

15 increase the number of copies of the loci of interest, which eliminates error generated from a low 
number of genomes. 

For increased specificity, a "hot-start" PCR reaction is used. PCR reactions are performed 
using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). The amount 
of template DNA and primer per reaction is optimized for each locus of interest In this example, 
20 the 20 pi of plasma template DNA is used. 

Two microliters of each forward and reverse primer, at concentrations of 5 mM are pooled 
into a single microcentrifuge tube and mixed. Four microliters of the primer mix is used in a total 
PCR reaction volume of 50 ^1 (20|il of template plasma DNA, 1 \il of sterile water, 4 fil of primer 
mix, and 25 ^1 of HotStar Taq. Twenty-five cycles of PCR are performed. The following PCR 
25 conditions are used: 

(1) 95°C for 15 minutes; 

(2) 95°Cfor30 second; 

(3) 4°Cfor30 seconds; 

(4) 37°C for 30 seconds; 

30 (5) Repeat steps 2-4 twenty-four (24) times; 

(6) 72°C for 10 minutes. 
The temperatures and times for denaturing, annealing, and extension, are optimized by 
trying various settings and using the parameters that yield the best results. 

Other methods of genomic amplification can also be used to increase the copy number of 
35 the loci of interest including but not limited to primer extension preamplification (PEP) (Zhang et 
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a/., PNAS, 89:5847-51, 1992), degenerate oligonucleotide primed PCR (DOP-PCR) (Telenius, et 
al, Genomics 13:718-25, 1992), strand displacement amplification using DNA polymerase from 
bacteriophage 29, which undergoes rolling circle replication (Dean et al> Genomic Research 
11:1095-99, 2001), multiple displacement amplification (U.S. Patent 6,124,120), REPLI-g™ Whole 
5 Genome Amplification kits, and Tagged PCR. 

It is important to ensure that the region amplified contains annealing sequences for the 
oligonucleotide probes in the BeadArray. Upon purchase of the BeadArray service, each SNP and 
the primers used to analyze each SNP are identified. With this knowledge, the multiplex primers 
are designed to encompass annealing regions for the primers in the BeadArray. 

10 

Purification of Fragment of Interest 

The unused primers, and nucleotides are removed from the reaction by using Qiagen 
MinElute PCR purification kits (Qiagen, Catalog Number 28004). The reactions are performed 
following the manufacturer's instructions supplied with the columns. The DNA is eluted in 100 ^il 
15 of sterile water. 

TaqMan Assay 

The amplified DNA is assayed with the TaqMan assay as described above. Each SNP is 
genotyped. SNPs located on chromosomes 13 and 21, wherein the maternal DNA is homozygous, 
20 and DNA isolated from the plasma is heterozygous are quantitated. 

Quantification 

The fluorescent intensity of the TaqMan allele specific probe is quantitated. As discussed 
above, the expected ratio of allele 1 to allele 2 is used to determine the presence or absence of a 
25 chromosomal abnormality. If the maternal genome is homozygous at SNP X (A/A), and the plasma 
DNA is heterozygous at SNP X (A/G), then the G represents the distinct fetal signal. 

The fluorescent intensity of the allele with the A nucleotide is quantitated and the intensity 
of the allele with the G nucleotide is quantitated. The ratio of G:A depends on the percentage of 
fetal DNA present in the maternal blood. 
30 For example, if the sample contains 50% fetal DNA, then the expected ratio is 0.33 (1 fetal 

G allele/(2 maternal A alleles + 1 fetal A allele)). This ratio should be constant for all 
chromosomes that are present in two copies. The ratio that is obtained for SNPs on chromosome 13 
should be the same as the ratio that is obtained for chromosome 21 . 

However, if the fetal genome contains an additional copy of chromosome 21, then the ratio 
35 for this chromosome will deviate from the expected ratio. The expected ratio for a Trisomy 
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condition with 50% fetal DNA in the maternal blood is 0.25. Thus, by analyzing SNPs wherein the 
maternal genome is homozygous, and the DNA that is isolated from the plasma is heterozygous, 
fetal chromosomal abnormalities can be detected. 

This example explained the use of the TaqMan assay, but it not intended to limit the use of 
5 other techniques that employ 5' nuclease activity. For example, the SYBR® Green I double 
stranded dye can also be used with the ABI PRISM 7700 Sequence Detection System and the 
GeneAmp® 5700 Sequence Detection system for determining the sequence of maternal and fetal 
DNA. The SYBR® Green I double stranded dye assay may be used to detect the presence of fetal 
chromosomal abnormalities in the maternal blood. 
1 0 SYBR® Green I double stranded dye is a highly specific double-stranded DNA binding 

dye that allows the detection of product accumulation during PCR. However, the SYBR® Green I 
double stranded dye assay detects all double stranded DNA including non-specific reaction 
products. The advantage of the SYBR® Green I double stranded dye assay is that it does not 
require a probe. 

15 The same primers design parameters are recommended for both the TaqMan Assay and the 

SYBR® Green I double stranded dye assay (see Primer Design section in Example 21). The primer 
optimization parameters recommended for the TaqMan assay should also be followed for the 
SYBR® Green I double stranded dye assay. In addition, no template controls should also be run 
with the various concentrations of primers. 

20 Furthermore, Applied Biosystems sell other products that may be used to determine the 

sequence of maternal and fetal DNA including but not limited to Assays-on-Demand™ SNP 
genotyping products, and Assasys-by-Design SM Service SNP genotyping products. 

Having now fully described the invention, it will be understood by those of skill in the art 
that the invention can be performed with a wide and equivalent range of conditions, parameters, and 

25 the like, without affecting the spirit or scope of the invention or any embodiment thereof. 

All documents, e.g., scientific publications, patents and patent publications recited herein 
are hereby incorporated by reference in their entirety to the same extent as if each individual 
document was specifically and individually indicated to be incorporated by reference in its entirety. 
Where the document cited only provides the first page of the document, the entire document is 

30 intended, including the remaining pages of the document. 

EXAMPLE 23 

Fetal chromosomal abnormalities are determined by analyzing SNPs wherein the maternal 
35 template DNA is homozygous and the template DNA obtained from the plasma is heterozygous. 

382 



WO 2004/078994 



PCT/US2004/006337 



Plasma that is isolated from blood of a pregnant female contains both maternal template DNA and 
fetal template DNA. Any number of SNP detection methods can be used to analyze the maternal 
and plasma DNA. In this example, SNPs are analyzed by ThirdWave Technologies' Invader™ 
Assay for Nucleic Acid Detection. However, other techniques that exploit and quantitate biological 
5 structures formed in the presence of the correct sequence can be used. 

Collection of Blood Samples 

In accordance with an IRB approved study, blood samples are collected from pregnant 
women after informed consent is granted. Blood is collected into 9 ml EDTA Vacuette tubes 

10 (catalog number NC9897284) and 0.225 ml of 10% neutral buffered solution containing 

formaldehyde (4% w/v), is added to each tube, and each tube gently is inverted. The tubes are 
stored at 4°C until ready for processing. 

Any number of agents that impede cell lysis or stabilize cell membranes can be added to the 
tubes including but not limited to formaldehyde, and derivatives of formaldehyde, formalin, 

15 glutaraldehyde, and derivatives of glutaraldehyde, crosslinkers, primary amine reactive crosslinkers, 
sulfhydryl reactive crosslinkers, sulfhydryl addition or disulfide reduction, carbohydrate reactive 
crosslinkers, carboxyl reactive crosslinkers, photoreactive crosslinkers, cleavable crosslinkers, 
AEDP, APG, BASED, BM(PEO>3, BM(PEO) 4 , BMB, BMDB, BMH, BMOE, BS3, BSOCOES, 
DFDNB, DMA, DMP, DMS, DPDPB, DSG, DSP, DSS, DST, DTBP, DTME, DTSSP, EGS, 

20 HBVS, sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS or compounds listed in Table XXIII. Any 
concentration of agent that stabilizes cell membranes or impedes cell lysis can be added. In a 
preferred embodiment, the agent that stabilizes cell membranes or impedes cell lysis is added at a 
concentration that does not impede or hinder subsequent reactions. 

An agent that stabilizes cell membranes may be added to the maternal blood samples to 

25 reduce maternal cell lysis including but not limited to aldehydes, urea formaldehyde, phenol 
formaldehyde, DMAE (dimethylaminoethanol), cholesterol, cholesterol derivatives, high 
concentrations of magnesium, vitamin E, and vitamin E derivatives, calcium, calcium gluconate, 
taurine, niacin, hydroxylamine derivatives, bimoclomol, sucrose, astaxanthin, glucose, 
amitriptyline, isomer A hopane tetral phenylacetate, isomer B hopane tetral phenylacetate, 

30 citicoline, inositol, vitamin B, vitamin B complex, cholesterol hemisuccinate, sorbitol, calcium, 
coenzyme Q, ubiquinone, vitamin K, vitamin K complex, menaquinone, zonegran, zinc, ginkgo 
biloba extract, diphenylhydantoin, perftoran, polyvinylpyrrolidone, phosphatidylserine, tegretol, 
PABA, disodium cromglycate, nedocromil sodium, phenytoin, zinc citrate, mexitil, dilantin, sodium 
hyaluronate, or polaxamer 188. 

35 
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Isolation of Plasma and Maternal Cells 

The blood is stored at 4°C until processing. The tubes are spun at 1000 rpra for ten minutes 
in a centrifuge with braking power set at zero. The tubes are spun a second time at 1000 rpm for 
ten minutes. The supernatant (the plasma) of each sample is transferred to a new tube and spun at 
5 3000 rpm for ten minutes with the brake set at zero. The supernatant is transferred to a new tube 
and stored at -80°C. Approximately two milliliters of the "buffy coat," which contains maternal 
cells, is placed into a separate tube and stored at -80°C. 

Isolation of DNA 

10 DNA is isolated from the plasma sample using the Qiagen Midi Kit for purification of DNA 

from blood cells, following the manufacturer's instructions (QIAmp DNA Blood Midi Kit, Catalog 
number51183). DNA is eluted in 100 fxl of distilled water. The Qiagen Midi Kit also is used to 
isolate DNA from the maternal cells contained in the "buffy coat." 

1 5 Identification of Homozygous Maternal SNPs 

Third Wave Technologies Invader™ Assay 

The Invader™ Assay, which was developed by Third Wave Technologies (Madison, WI), 
is an isothermal, "PCR-free" approach to the detection and quantitative analysis of DNA. The 

20 Invader Assay produces and amplifies an unrelated signal only in the presence of the correct target 
sequence. The Invader™ Assay relies on a thermostable member of the structure-specific 
archeabacterial flap endonuclease (FEN) family, which cleaves nucleic acid molecules at specific 
sites based on structure rather than sequence. When uses with structure forming probes for known 
sequences, the enzymes cleave in a structure and target sequence-specific manner. The nucleases 

25 used with Third Wave Technologies' assays are referred to as "Cleavase®" enzymes. 

The Invader™ Assay uses two target-specific oligonucleotides to create the substrate 
complex recognized by Cleavase Enzymes (L. DeFrancesco, The Scientist, 12(21):16, 1998). The 
substrate complex is formed when an upstream Invader oligo and a downstream signal probe 
hybridize in tandem to the nucleic acid. The 3* end of the Invader oligo must overlap the 

30 hybridization site of the signal probe by at least one base (Harrington et al. 9 Genes and 

Development, 8:1344-55, 1994). The 5' end of the signal probe has additional unpaired bases to 
form a 5' flap. Cleavase enzymes cleave the signal probe where it overlaps the Invader oligo, 
releasing the 5' arm. Reaction mixtures contain excess signal probe and are carried out near the 
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melting temperature of the probe. Many signal probes can be cleaved for each copy of the target 
without temperature cycling. 

The overlap between the Invader oligo and the signal probe is important. A mismatch 
positioned at the site of the overlap will block the cleavage by disrupting the overlap, which may 
5 allow discrimination of SNPs and mutations. 

The Invader assay utilizes two sequential cleavage steps. The 5' arm of the signal probe 
released in the first reaction is not detected directly. Rather, a secondary cleavage product is the 
actual source of the signal, detected by fluorescence resonance energy transfer (FRET). The 
primary cleavage product, which is the 5' arm released in the first reaction, is used as an Invader 
10 oligo that hybridizes to a supplied FRET probe in the secondary reaction. The FRET probe is 
labeled with two dyes: a donor fluorophore and a quenching acceptor fluorophore. When the 
nuclease cleaves the secondary probe, the two fluorophores are separated, quenching is eliminated, 
and the enhanced fluorescence signal from the donor dye is detected. 

15 Probe Design 

The following probes are designed to determine the sequence at SNP TSC1 172576 (T/A), 
which is located on chromosome 13: 
Invader Oligo for T allele: 

20 5' CATGCAGATATACCGCATAT 3' 

Invader Oligo for A allele: 

5' CATGCAGATATACCGCATAA 3 ' 

25 

Invader oligonucleotides are designed to be complementary to an 18-22 base region 
immediately upstream of the signal probe, with an additional one base at the 3* end that "invades" 
the region hybridized to the signal probe by one base. 

30 Signal probe for T allele: 

GGTAGCATCT CTCAGCACAAGAG 

Signal probe for A allele: 

35 
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GGTAGCATCA CTCAGCACAAGAG 

The signal probes are designed to contain a 3' region that is complementary to the target 
sequence and a non-complementary 5' arm (the underlined sequence above) that is used for 
5 detection. The signal probes are labeled on the 5' end with 6-carboxyfluorescein (TET), 
hexachloro-6-carboxyfluorescein (HEX), 6-carboxyfluorescein (FAM). However, the 5' end can be 
labeled with any chemical moiety including but not limited to radioisotope, fluorescent reporter 
molecule, chemiluminescent reporter molecule, antibody, antibody fragment, hapten, biotin, 
derivative of biotin, photobiotin, iminobiotin, digoxigenin, avidin, enzyme, acridinium, sugar, 
10 enzyme, apoenzyme, homopolymeric oligonucleotide, hormone, ferromagnetic moiety, 
paramagnetic moiety, diamagnetic moiety, phosphorescent moiety, luminescent moiety, 
electrochemiluminescent moiety, chromatic moiety, moiety having a detectable electron spin 
resonance, electrical capacitance, dielectric constant or electrical conductivity, and combinations 
thereof 

15 Both invader and signal probes are complementary to either the sense or the antisense target 

DNA strand, depending on which results in the formation of the least number of predictable 
secondary structures. 

PGR Amplification 

20 A fragment of DNA that surrounds SNP TSC1 172576 is amplified by PCR. The sequence 

of the upstream and downstream primers is provided below: 

Upstream Primer: 

5' TAGCAGAATCTCTCAT 3 ' 

25 

Downstream Primer: 

5' AGAGTATCTCATTTGTT 3' 

Amplification reactions are performed in a final volume of 100 jul of containing 2 |il of 
30 genomic DNA, 35 pmol of each primer, 50 nm of each deoxynucleotide (Perkin-Elmer Applied 
Biosystems, Inc., Foster City, CA), 1 XPCR buffer (20 mM Tris-Hcl, SOmMKCl, 1.5mMMgCl 2 , 
0.05%Tween-20, 0.05% NP40), 1 M betaine, 5% dimethylsulfoxide (DMSO), and 2.5 U of Taq 
polymerase (Roche Boehringer Mannheim, Indianapolis, IN). PCR cycling conditions consist of an 
initial denaturation step at 95°C for 5 min, 30 cycles of denaturation at 95°C for 1 min, annealing at 
35 68°C for 1 min, and extension at 72°C for 1 min, and a final extension at 72°C for 5 min. 
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Invader Reaction 

One microliter of each PCR product is added to 0.5 pmol of the appropriate Invader 
oligonucleotide, 10 ng human genomic DNA (Promega Corp., Madison, WI) as the carrier, and 
5 mopholinepropanesulfonic acid (MOPS) buffer (pH 8.0) at a final concentration of 10 raM in a 
volume of 7 \xl. The mixtures are denatured for 5 min at 95°C and then cooled to reaction 
temperature of 60°C. Invader reactions are inititated by the addition of a mixture containing 30 ng 
of Cleavase Vm (Third Wave Technologies, Inc., Madison, WI) 25 mM MgCl 2 , and 10 pmol of the 
appropriate signal probe oligonucleotide in a volume of 3 fxl. Reaction mixtures are incubated for 

10 60 min. The reactions are terminated by the addition of 10 jxl of 95% formamide -10 mM EDTA 
(pH 8.0)- 0.05% crystal violet. Following termination, the reactions are diluted 1:10 in reagent- 
grade water. Samples of 2 |xl are loaded and electrophoresed in a 24% denaturing polyacrylamide 
gel (18 cm by 25.5 cm by mm) on an automated fluorescence sequencing apparatus (model 377, 
PE-ABI). The data are collected using filter set C and processed with GeneScan software. 

1 5 In addition, 5 julI of each sample is electrophoresed in 20% (acrylamide to bisacrylamide, 

19:1) denaturing polyacrylamide gels at 20 W. Gel cassettes (20 cm by 20 cm by 0.5 mM) are 
scanned with a fluorescent scanner (FMBIO-100; Hitachi Corp, San Bruno, CA) by using a 585 nm 
filter for TET and HEX labeled probes and a 505 nm filter for FAM labeled probes. 

With the Invader assay, it is also possible to perform a second cleavage reaction where the 

20 released 5' arm of the signal probe is hybridized to another probe, and fluorescence resonance 

energy transfer (FRET) is used to detect the presence of a specific nucleic acid. The manufacturer's 
protocols are followed when using the FRET probe. 

The genotype at each SNP is determined by analyzing the fluorescence intensity of each 
allele-specific signal probe. For example, for SNP TSC1 172576, the presence of the T allele is 

25 determined by analyzing the amount of released 5' signal probe using the signal probe from the T 
allele signal probe (as described above). Likewise, the presence of the A allele is determined by 
analyzing the amount of released 5'signal probe from the A allele signal probe. The reactions can 
be performed in a single reaction vessel using two different chemical moieties, which can be 
analyzed under distinct conditions, or the A and T allele reactions can be performed in two different 

30 reaction vessels. 



Analysis of DNA Isolated from Maternal Plasma 

After the maternal DNA is analyzed and homozygous SNPs are identified, these SNPs are 
analyzed with the DNA isolated from the plasma. A low copy number of fetal genomes typically 
35 exist in the maternal plasma. To increase the copy number of the loci of interest, which are the 
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SNPs at which the maternal DNA is homozygous, primers are designed to anneal at approximately 
130 bases upstream and 1 30 bases downstream of each loci of interest. This is done to reduce 
statistical sampling error that can occur when working with a low number of genomes, which can 
influence the ratio of one allele to another (see Example 1 1). 

5 

Design of Multiplex Primers 

The primers are 12 bases in length. However, primers of any length can be used including 
but not limited to 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 
26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36-45, 46-55, 56-65, 66-75, 76-85, 86-95, 96-105, 106-115, 
10 1 16-125, and greater than 125 bases. Primers are designed to anneal to both the sense strand and 
the antisense strand. 

The maternal homozygous SNPs vary from sample to sample so defined sequences are not 
provided here. Primers are designed to anneal about 130 bases upstream and downstream of the 
maternal homozygous SNPs. The primers are designed to terminate at the 3' end in the dinucleotide 
15 "AA" to reduce the formation of primer-dimers. However, the primers can be designed to end in 
any of the four nucleotides and in any combination of the four nucleotides. 

Multiplex PCR 

Regions upstream and downstream of the maternal homozygous SNPs are amplified from 
20 the template genomic DNA using the polymerase chain reaction (PCR, U.S. Patent Nos. 4,683,195 
and 4,683,202, incorporated herein by reference). This PCR reaction uses primers that anneal 
approximately 130 bases upstream and downstream of each loci of interest. The primers are mixed 
together and are used in a single reaction to amplify the template DNA. This reaction is done to 
increase the number of copies of the loci of interest, which eliminates error generated from a low 
25 number of genomes. 

For increased specificity, a "hot-start" PCR reaction is used. PCR reactions are performed 
using the HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). The amount 
of template DNA and primer per reaction is optimized for each locus of interest In this example, 
the 20 jil of plasma template DNA is used. 
30 Two microliters of each forward and reverse primer, at concentrations of 5 mM are pooled 

into a single microcentrifuge tube and mixed. Four microliters of the primer mix is used in a total 
PCR reaction volume of 50 [d (20jxl of template plasma DNA, 1 \il of sterile water, 4 \xl of primer 
mix, and 25 pi of HotStar Taq. Twenty-five cycles of PCR are performed. The following PCR 
. conditions are used: 
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(1) 95°C for 15 minutes; 

(2) 95°Cfor30 second; 

(3) 4°C for 30 seconds; 

(4) 37°C for 30 seconds; 

5 (5) Repeat steps 2-4 twenty-four (24) times; 

(6) 72°Cfor 10 minutes. 
The temperatures and times for denaturing, annealing, and extension, are optimized by 
trying various settings and using the parameters that yield the best results. 

Other methods of genomic amplification can also be used to increase the copy number of 
10 the loci of interest including but not limited to primer extension preamplification (PEP) (Zhang et 
aL, PNAS, 89:5847-51, 1992), degenerate oligonucleotide primed PCR (DOP-PCR) (Telenius, et 
aL, Genomics 13:718-25, 1992), strand displacement amplification using DNA polymerase from 
bacteriophage 29, which undergoes rolling circle replication (Dean et aL, Genomic Research 
11:1095-99, 2001), multiple displacement amplification (U.S. Patent 6,124,120), REPLI-g™ Whole 
1 5 Genome Amplification kits, and Tagged PCR. 

It is important to ensure that the region amplified contains annealing sequences for the 
oligonucleotide probes in the BeadArray. Upon purchase of the BeadArray service, each SNP and 
the primers used to analyze each SNP are identified. With this knowledge, the multiplex primers 
are designed to encompass annealing regions for the primers in the BeadArray. 

20 

Purification of Fragment of Interest 

The unused primers, and nucleotides are removed from the reaction by using Qiagen 
MinElute PCR purification kits (Qiagen, Catalog Number 28004). The reactions are performed 
following the manufacturer's instructions supplied with the columns. The DNA is eluted in 100 
25 of sterile water. 

r 

Invader Assay 

The amplified DNA is assayed with the Invader assay as described above. Each SNP is 
genotyped. SNPs located on chromosomes 13 and 21, wherein the maternal DNA is homozygous, 
30 and DNA isolated from the plasma is hetero2ygous are quantitated. 

Quantification 

The fluorescent intensity of the allele specific signal probe is quantitated. As discussed 
above, the expected ratio of allele 1 to allele 2 is used to determine the presence or absence of a 
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chromosomal abnormality. If the maternal genome is homozygous at SNP X (A/A), and the plasma 
DNA is heterozygous at SNP X (A/G), then the G represents the distinct fetkl signal. 

The fluorescent intensity of the allele with the A nucleotide is quantitated and the intensity 
of the allele with the G nucleotide is quantitated. The ratio of G: A depends on the percentage of 
5 fetal DNA present in the maternal blood. 

For example, if the sample contains 50% fetal DNA, then the expected ratio is 0.33 (1 fetal 
G allele/ (2 maternal A alleles + 1 fetal A allele)). This ratio should be constant for all 
chromosomes that are present in two copies. The ratio that is obtained for SNPs on chromosome 13 
should be the same as the ratio that is obtained for chromosome 21. 
10 However, if the fetal genome contains an additional copy of chromosome 21, then the ratio 

for this chromosome will deviate from the expected ratio. The expected ratio for a Trisomy 
condition with 50% fetal DNA in the maternal blood is 0.25. Thus, by analyzing SNPs wherein the 
maternal genome is homozygous, and the DNA that is isolated from the plasma is heterozygous, 
fetal chromosomal abnormalities can be detected. 

15 

EXAMPLE 24 

Tissue and cell injury occur under both normal and pathological conditions. Therefore, it is 
expected that intracellular material, including DNA, may be released into the circulation. Indeed, 
free DNA has been detected in serum and other fluids (Hughes et al., Arthritis and Rheumatology, 

20 14:259-266, 1971). In addition, increased quantities of free DNA have been found in patients with 
chronic autoimmune disorders and cancer (Leon et al, Cancer Research; 37:646-650, 1977). 
However, quantitative assessments of plasma DNA are neither sensitive nor specific enough for 
diagnostic use, since levels overlap considerably between cancer patients and normal subjects. In 
addition, a large percentage of the free DNA is normal DNA, which makes it very difficult to detect 

25 the presence of the mutant DNA. 

As discussed in Examples 4 and 15 above, the use of cell lysis inhibitors, cell membrane 
stabilizers, or cross-linking reagents can be used to increase the percentage of free fetal DNA in the 
maternal blood. In this example, these methods are used to increase the percentage of mutant DNA 
compared to normal DNA in a plasma sample, which is obtained from a blood sample that is treated 

30 with formaldehyde. However, these methods can be extended to increase the percentage of mutant 
DNA compared to normal DNA in any sample. This example describes the detection of mutant 
DNA at codon 1370 of the APC gene. 

Collection of Blood Samples 
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In accordance with an IRB approved study, blood samples are collected from individuals 
after informed consent is granted. Blood is collected into 9 ml EDTA Vacuette tubes (catalog 
number NC9897284) and 0.225 ml of 10% neutral buffered solution containing fonnaldehyde (4% 
w/v), is added to each tube, and each tube gently is inverted. The tubes are stored at 4°C until ready 
5 for processing. 

Any number of agents that impede cell lysis or stabilize cell membranes can be added to the 
tubes including but not limited to formaldehyde, and derivatives of formaldehyde, formalin, 
glutaraldehyde, and derivatives of glutaraldehyde, crosslinkers, primary amine reactive crosslinkers, 
sulfhydryl reactive crosslinkers, sulfhydryl addition or disulfide reduction, carbohydrate reactive 

10 crosslinkers, carboxyl reactive crosslinkers, photoreactive crosslinkers, cleavable crosslinkers, 
AEDP, APG, BASED, BM(PEO) 3 , BM(PEO) 4 , BMB, BMDB, BMH, BMOE, BS3, BSOCOES, 
DFDNB, DMA, DMP, DMS, DPDPB, DSG, DSP, DSS, DST, DTBP, DIME, DTSSP, EGS, 
HBVS, sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS or compounds listed in Table XXIH. Any 
concentration of agent that stabilizes cell membranes or impedes cell lysis can be added. In a 

15 preferred embodiment, the agent that stabilizes cell membranes or impedes cell lysis is added at a 
concentration that does not impede or hinder subsequent reactions. 

An agent that stabilizes cell membranes may be added to the samples to reduce cell lysis 
including but not limited to aldehydes, urea formaldehyde, phenol formaldehyde, DMAE 
(dimethylaminoethanol), cholesterol, cholesterol derivatives, high concentrations of magnesium, 

20 vitamin E, and vitamin E derivatives, calcium, calcium gluconate, taurine, niacin, hydroxylamine 
derivatives, bimoclomol, sucrose, astaxanthin, glucose, amitriptyline, isomer A hopane tetral 
phenylacetate, isomer B hopane tetral phenylacetate, citicoline, inositol, vitamin B, vitamin B 
complex, cholesterol hemisuccinate, sorbitol, calcium, coenzyme Q, ubiquinone, vitamin K, 
vitamin K complex, menaquinone, zonegran, zinc, ginkgo biloba extract, diphenylhydantoin, 

25 perftoran, polyvinylpyrrolidone, phosphatidylserine, tegretol, PABA, disodjum cromglycate, 
nedocromil sodium, phenytoin, zinc citrate, mexitil, dilantin, sodium hyaluronate, or polaxamer 
188. 

Blood is collected by any method or process that results in a substantial increase in the ratio 
of mutant DNA/normal DNA in the resulting serum or plasma after appropriate processing. As 
30 used herein, a substantial increase in the ratio of mutant DNA/normal DNA is that which can be 
detected by the methods as described herein. Such methods or processes typically result in a 
substantial increase in the ratio of mutant DNA/normal DNA of about 5%, 10%, 15%, 20%, 30%, 
50%, 70%, 80%, 100% or more of the ratio of mutant DNA/normal DNA found in blood samples 
collected by standard procedures. 
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In other embodiments, blood is collected by any method or process that results in a 
substantial increase in the amount of free DNA compared to the amount of total DNA recovered or 
detected in the resulting serum or plasma after processing. Such methods or processes typically 
result in a substantial increase so the free DNA recovered or detected is about 10%, 15%, 20%, 
5 25%, 30%, 40%, 50% or more of the total DNA recovered or detected in the processed plasma or 
serum sample. 

The methods or processes of collecting blood samples may also include other steps that 
result in lessened or reduced cell lysis. For instance, blood collection devices may be modified to 
decrease cell lysis due to sheer forces in the collection needle, syringe or tubes used. For instance, 
1 0 needles of large gauge may be employed to reduce cell sheering or vacutainer tubes may be 
modified to reduce the velocity of blood flow. 

Isolation of Plasma 

Any method may be used to isolate plasma from the cell components of blood after 
15 collection but methods wherein cell lysis is substantially prevented, reduced or inhibited are 
preferred. The blood is stored at 4°C until processing. Methods for isolation of the plasma are 
implemented to reduce the amount of maternal cell lysis. The tubes are spun at 1000 rpm for ten 
minutes in a centrifuge with braking power and acceleration power set at zero to substantially 
prevent, reduce or inhibit cell lysis and or mixing of blood cell components into the plasma. The 
20 tubes are spun a second time at 1000 rpm for ten minutes with braking power (centrifuge stopped 
by natural deceleration) and acceleration power set to zero. The supernatant (the plasma) of each 
sample is transferred carefully to a new tube and spun at 3000 rpm for ten minutes with the brake 
and acceleration power set at zero. The supernatant (the plasma) of each sample is collected via 
procedures to substantially prevent mixing of cell components into the plasma. Great care is taken 
25 to ensure that the buffy-coat is not disturbed. A percentage of the supernatant can be left in the tube 
including but not limited to 0.001-1%, 1-10%, 10-20%, 20-30%, 30-40%, 40-50%, 50-60%, 60- 
70%, 70-80% or greater than 80%. The supernatant is transferred to a new tube and stored at - 
80°C. 

30 Isolation of DNA 

DNA is isolated from the plasma sample using the Qiagen Midi Kit for purification of DNA 
from blood cells, following the manufacturer's instructions (QIAmp DNA Blood Midi Kit, Catalog 
number 51183). DNA is eluted in 100^1 of distilled water. However, any method of DNA 
isolation can be used including cesium chloride gradients, gradients, sucrose gradients, glucose 
35 gradients, centrifixgation protocols, boiling, Qiagen purification systems, QIA DNA blood 
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purification kit, HiSpeed Plasmid Maxi Kit, QIAfilter plasmid kit, Promega DNA purification 
systems, MangeSil Paramagnetic Particle based systems, Wizard SV technology, Wizard Genomic 
DNA purification kit, Amersham purification systems, GFX Genomic Blood DNA purification kit, 
Invitrogen Life Technologies Purification Systems, CONCERT purification system, Mo Bio 
5 Laboratories purification systems, UltraClean BloodSpin Kits, and UlraClean Blood DNA Kit. The 
skilled artisan understands that the manufacturer's protocols can be modified to increase the yield 
of DNA. For example, the Qiagen Midi Kit for purification of DNA recommends the use of IX AL 
buffer. However, any concentration of AL buffer may be used if the yield of DNA increases 
including but not limited to 0.1-0.5X AL buffer, 0.5-1X AL buffer, 1X-2X AL buffer, 2-3X AL 
1 0 buffer, 3-4X AL buffer, 4-5X AL buffer, and greater than 5X AL buffer. The skilled artisan 

understands that the modifications and manipulations of the reagents are not limited to AL buffer. 

Detection of Mutant DNA 

15 Design of Primers 

To determine if a mutation resides at codon 1370 of the APC gene, the following primers 
are used: 

First primer: 

5' GTGCAAAGGCCTGAATTCCCAGGCACAAAGCTGTTGAA 3' 

20 

Second primer: 

5' TGAAGCGAACTAGGGACTCAGGTGGACTT 

The first primer contains a biotin tag at the extreme 5' end, and the nucleotide sequence for 
25 the restriction enzyme EcoRI. The second primer contains the nucleotide sequence for the 
restriction enzyme BsmF I. 

PCR Reaction 

The loci of interest are amplified from the template genomic DNA using the polymerase 
30 chain reaction (PCR, U.S. Patent Nos. 4,683,195 and 4,683,202, incorporated herein by reference). 
The loci of interest are amplified in separate reaction tubes; they can also be amplified together in a 
single PCR reaction. For increased specificity, a "hot-start" PCR reaction is used, e.g. by using the 
HotStarTaq Master Mix Kit supplied by QIAGEN (catalog number 203443). The amount of 
template DNA and primer per reaction are optimized for each locus of interest but in this example, 
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40 ng of template human genomic DNA and 5 |iM of each primer are used. Forty cycles of PCR 
are performed. The following PCR conditions are used: 

(1) 95°C for 15 minutes and 15 seconds; 
5 (2) 37°C for 30 seconds; 

(3) 95°Cfor30 seconds; 

(4) 57°C for 30 seconds; 

(5) 95°Cfor30 seconds; 

(6) 64°Cfor 30 seconds; 
10 (7) 95°C for 30 seconds; 

(8) Repeat steps 6 and 7 thirty nine (39) times; 

(9) 72°C for 5 minutes. 

In the first cycle of PCR, the annealing temperature is about the melting temperature of the 
15 3 ' annealing region of the second primers, which is 37°C. The annealing temperature in the second 
cycle of PCR is about the melting temperature of the 3* region, which anneals to the template DNA, 
of the first primer, which is 57°C. The annealing temperature in the third cycle of PCR is about the 
melting temperature of the entire sequence of the second primer, which is 64°C. The annealing 
temperature for the remaining cycles is 64°C. Escalating the annealing temperature from TM1 to 
20 TM2 to TM3 in the first three cycles of PCR greatly improves specificity. These annealing 

temperatures are representative, and the skilled artisan understands that the annealing temperatures 
for each cycle are dependent on the specific primers used. 

The temperatures and times for denaturing, annealing, and extension, are optimized by 
trying various settings and using the parameters that yield the best results. 

25 

Purification of Fragment of Interest 

The PCR products are separated from the genomic template DNA. Each PCR product is 
divided into four separate reaction wells of a Streptawell, transparent, High-Bind plate from Roche 
Diagnostics GmbH (catalog number 1 645 692, as listed in Roche Molecular Biochemicals, 2001 

30 Biochemicals Catalog). The first primers contain a 5 f biotin tag so the PCR products bound to the 
Streptavidin coated wells while the genomic template DNA does not. The streptavidin binding 
reaction is performed using a Thermomixer (Eppendorf) at 1000 rpm for 20 min. at 37°C. Each 
well is aspirated to remove unbound material, and washed three times with IX PBS, with gentle 
mixing (Kandpal et al., Nucl. Acids Res. 18:1789-1795 (1990); Kaneoka et al., Biotechniques 

35 10:30-34 (1991); Green et al., Nucl. Acids Res. 18:6163-6164 (1990)). 
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Alternatively, the PCR products are placed into a single well of a streptavidin plate to 
perform the nucleotide incorporation reaction in a single well. 

Restriction Enzyme Digestion of Isolated Fragments 

5 The purified PCR products are digested with the restriction enzyme BsmF I (New England 

Biolabs catalog number R0572S), which binds to the recognition site incorporated into the PCR 
products from the second primer. The digests are performed in the Streptawells following the 
instructions supplied with the restriction enzyme. After digestion with the appropriate restriction 
enzyme, the wells are washed three times with PBS to remove the cleaved fragments. 

10 

Incorporation of Labeled Nucleotide 

The restriction enzyme digest described above yields a DNA fragment with a 5* overhang, 
which contains the locus of interest and a 3* recessed end. The 5' overhang functions as a template 
allowing incorporation of a nucleotide or nucleotides in the presence of a DNA polymerase. 

15 For each locus of interest, four separate fill in reactions are performed; each of the four 

reactions contains a different fluorescently labeled ddNTP (ddATP, ddTTP, ddGTP, or ddCTP). 
The following components are added to each fill in reaction: 1 \il of a fluorescently labeled ddNTP, 
0.5 ]i\ of unlabeled ddNTPs ( 40 jiM), which contains all nucleotides except the nucleotide that is 
fluorescently labeled, 2 ^1 of 10X sequenase buffer, 0.25 fil of Sequenase, and water as needed for a 

20 20|nl reaction. The fill are performed in reactions at 40°C for 10 min. Non-fluorescently labeled 
ddNTP are purchased from Fermentas Inc. (Hanover, MD). All other labeling reagents are obtained 
from Amersham (Thermo Sequenase Dye Terminator Cycle Sequencing Core Kit, US 79565). In 
the presence of fluorescently labeled ddNTPs, the 3' recessed end is extended by one base, which 
corresponds to the locus of interest. 

25 A mixture of labeled ddNTPs and unlabeled dNTPs also can be used for the fill-in reaction. 

The "fill in" conditions are as described above except that a mixture containing 40 jiM unlabeled 
dNTPs, 1 (il fluorescently labeled ddATP, 1 ^1 fluorescently labeled ddTTP, 1 pi fluorescently 
labeled ddCTP, and 1 \il ddGTP are used. The fluorescent ddNTPs are obtained from Amersham 
(Thermo Sequenase Dye Terminator Cycle Sequencing Core Kit, US 79565; Amersham does not 

30 publish the concentrations of the fluorescent nucleotides). The locus of interest is digested with the 
restriction enzyme BsmF I, which generates a 5' overhang of four bases. If the first nucleotide 
incorporated is a labeled ddNTP, the 3' recessed end is filled in by one base, allowing detection of 
the locus of interest. However, if the first nucleotide incorporated is a dNTP, the polymerase 
continues to incorporate nucleotides until a ddNTP is filled in. For example, the first two 

35 nucleotides may be filled in with dNTPs, and the third nucleotide with a ddNTP, allowing detection 
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of the third nucleotide in the overhang. Thus, the sequence of the entire 5 ! overhang is determined, 
which increases the information obtained from each SNP or locus of interest. This type of fill in 
reaction is especially useful when detecting the presence of insertions, deletions, insertions and 
deletions, rearrangements, and translocations. 
5 Alternatively, one nucleotide labeled with a single dye is used to determine the sequence of 

the locus of interest. See Example 6. This method eliminates any potential errors when using 
different dyes, which have different quantum coefficients. 

After labeling, each Streptawell is rinsed with IX PBS (100 three times. The "filled in" 
DNA fragments are released from the Streptawells by digesting with the restriction enzyme EcoRI, 
1 0 according to the manufacturer's instructions that are supplied with the enzyme. The digestion is 
performed for 1 hour at 37 °C with shaking at 120 rpm. 

Detection of the Locus of Interest 

After release from the streptavidin matrix, the sample is loaded into a lane of a 36 cm 5% 
1 5 aciylamide (urea) gel (BioWhittaker Molecular Applications, Long Ranger Run Gel Packs, catalog 
number 50691). The sample is electrophoresed into the gel at 3000 volts for 3 min. The gel is run 
for 3 hours using a sequencing apparatus (Hoefer SQ3 Sequencer). The incorporated labeled 
nucleotide is detected by fluorescence. 

To determine if any cells contain mutations at codon 1370 of the APC gene when separate 
20 fill-in reactions are performed, the lanes of the gel that correspond to the fill-in reaction for ddATP 
and ddTTP are analyzed. If only normal cells are present, the lane corresponding to the fill in 
reaction with ddATP is a bright signal. No signal is detected for the "fill-in" reaction with ddTTP. 
However, if the patient sample contains cells with mutations at codon 1370 of the APC gene, the 
lane corresponding to the fill in reaction with ddATP is a bright signal, and a signal is detected from 
25 the lane corresponding to the fill in reaction with ddTTP. The intensity of the signal from the lane 
corresponding to the fill in reaction with ddTTP is indicative of the number of mutant cells in the 
sample. 

Alternatively, one labeled nucleotide is used to determine the sequence of the alleles at 
codon 1370 of the APC gene. At codon 1370, the normal sequence is AAA, which codes for the 
30 amino acid lysine. However, a nucleotide substitution has been identified at codon 1370, which is 
associated with colorectal tumors. Specifically, a change from A to T (AAA-TAA) typically is 
found at codon 1370, which results in a stop codon. A single fill-in reaction is performed using 
labeled ddATP, and unlabeled dTTP, dCTP, and dGTP. A single nucleotide labeled with one 
fluorescent dye is used to determine the presence of both the normal and mutant DNA sequence that 
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codes for codon 1370. The relevant DNA sequence is depicted below with the sequence 
corresponding to codon 1370 in bold: 

5' CCCAAAAGTCCACCTGA 
5 3' GGGTTTTCAGGTGGACT 

After digest with BsmF I, the following overhang is produced: 

5' CCC 

10 3* GGG T T T T 

Overhang position 12 3 4 

If the patient sample has no cells harboring a mutation at codon 1370, one signal is 
seen corresponding to incorporation of labeled ddATP. 

15 

5' CCC A* 

3 J GGG T T T T 

Overhang position 12 3 4 

20 However, if the patient sample has cells with mutations at codon 1370 of the APC 

gene, one signal is seen, which corresponds to the normal sequence at codon 1370, and a second 
signal is seen, which corresponds to the mutant sequence at codon 1370. The signals clearly are 
identified as they differ in molecular weight. 

25 Overhang of normal DNA sequence: CCC 

GGG T T T T 
Overhang position 12 3 4 

Normal DNA sequence after fill-in: CCC A* 
30 GGG T T T T 

Overhang position 12 3 4 

Overhang of mutant DNA sequence: CCC 

GGG A T T T 
35 Overhang position 1 2 3' 4 
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Mutant DNA sequence after fill-in: CCC T A* 

GGG A T T T 
Overhang position 12 3 4 

5 

Two signals are seen when the mutant allele is present The mutant DNA molecules are 
filled in one base after the wild type DNA molecules. The two signals are separated using any 
method that discriminates based on molecular weight. One labeled nucleotide (ddATP) is used to 
detect the presence of both the wild type DNA sequence and the mutant DNA sequence. This 

10 method of labeling reduces the number of reactions that need to be performed and allows accurate 
quantitation for the number of mutant cells in the patient sample. The number of mutant cells in the 
sample is used to determine patient prognosis, the degree and the severity of the disease. This 
method of labeling eliminates the complications associated with using different dyes, which have 
distinct quantum coefficients. This method of labeling also eliminates errors associated with 

1 5 pipetting reactions. 

Any method for determining the sequence of alleles of a locus of interest can be used 
including but not limited to allele specific PCR, gel electrophoresis, ELISA, mass spectrometry, 
hybridization, primer extension, fluorescence polarization, fluorescence detection, fluorescence 
resonance energy transfer (FRET), sequencing, DNA microarray, SNP-IT, GeneChips, HuSNP, 

20 BeadArray, TaqMan assay, Invader assay, MassExtend, MassCleave™ (hMC) method, southern 
blot, slot blot, dot blot, and MALDI-TOF mass spectrometry. 

In the above example, methods for the detection of a nucleotide substitution are described. 
However, the methods can be used for the detection of any type of mutation including but not 
limited to nucleotide substitutions (see Table VII), splicing errors (see Table VDI), small deletions 

25 (see Table IX), small insertions (see Table X), small insertions/deletions (see Table XI), gross 

deletions (see Table XH), gross insertions (see Table XIII), and complex rearrangements (see Table 
XIV). 

i 

In addition, the above-described methods are used for the detection of any type of disease 
including but not limited to those listed in Table IV. Furthermore, any type of mutant gene is 

30 detected using the inventions described herein including but not limited to the genes associated with 
the diseases listed in Table IV, BRCA1, BRCA2, MSH6, MSH2, MLH1, RET, PTEN, ATM, H- 
RAS, p53, ELAC2, CDH1, APC, AR, PMS2, MLH3, CYP1A1, GSTP1, GSTM1, AXIN2, CYP19, 
MET, NAT1, CDKN2A, NQ01, trc8, RAD51, PMS1, TGFBR2, VHL, MC4R, POMC, NROB2, 
UCP2, PCSK1, PPARG, ADRB2, UCP3, glurl, cart, SORBS1, LEP, LEPR, SIM1, TNF, IL-6, BL- 

35 1, EL-2, IL-3, EL1A, TAP2, THPO, THRB, NBS1, RBM15, UF, MPL, RUNX1, Her-2, 
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glucocorticoid receptor, estrogen receptor, thyroid receptor, p21, p27, K-RAS, N-RAS, 
retinoblastoma protein, Wiskott-Aldrich (WAS) gene, Factor V Leiden, Factor II (prothrombin), 
methylene tetrahydrofolate reductase, cystic fibrosis, LDL receptor, HDL receptor, superoxide 
dismutase gene, SHOX gene, genes involved in nitric oxide regulation, genes involved in cell cycle 
5 regulation, tumor suppressor genes, oncogenes, genes associated with neurodegeneration, genes 
associated with obesity. Abbreviations correspond to the proteins as listed on the Human Gene 
Mutation Database, which is incorporated herein by reference www.archive.uwcm.ac.uk./uwcm; 
website address active as of February 12, 2003). 

The above-example demonstrates methods to increase the ratio of mutant allele to normal 

10 allele in a plasma sample. However, the methods described herein are used for detection of mutant 
cells from any biological sample including but not limited to blood sample, serum sample, plasma 
sample, urine sample, spinal fluid, lymphatic fluid, semen, vaginal secretion, ascitic fluid, saliva, 
mucosa secretion, peritoneal fluid, fecal sample, body exudates, breast fluid, lung aspirates, cells, 
tissues, individual cells or extracts of the such sources that contain the nucleic acid of the same, and 

15 subcellular structures such as mitochondria or chloroplasts. In addition, the methods described 
herein are used for the detection of mutant cells and mutated DNA from any number of nucleic acid 
containing sources including but not limited to forensic, food, archeological, agricultural or 
inorganic samples. 

The above example is directed to detection of mutations in the APC gene. By increasing 
20 the ratio of mutant APC allele to normal allele in plasma, the detection of the mutant APC allele is 
more efficient and more accurate. These methods can be applied to the detection of mutations in 
any gene including but not limited to genes associated with or that predispose to disease (see Table 
XV). In addition, these methods can be used to detect disease state, monitor disease progression 
and response to therapy. For example, the detection of the mutant allele in a plasma sample 
25 provides a non-invasive method for monitoring cancer recurrence. Detection of the mutant allele, 
which indicates the presence of the tumor cells, suggests that new or more aggressive treatments 
should be prescribed. 

The present methods can also be used to monitor and detect microsatellite alterations in 
plasma DNA of small cell lung cancer patients. Microsatellite instability is an important 

30 characteristic of numerous tumor types, particularly those associated with hereditary non-polyposis 
colorectal carcinoma (HNPCC) syndrome (Chen et al., Nature Medicine, Vol 2:1033-1035). The 
methods described in this example also can be applied to the detection of microsatellite DNA 
alterations in serum DNA of head and neck cancer patients (Nawroz et al., Nature Medicine, Vol. 
2:1035-1037. By increasing the ratio of the abnormal DNA to the normal DNA, detection will be 

35 improved, thereby allowing for earlier diagnosis and increased chance of recovery. 
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Furthermore, these methods can be applied to the detection of an infectious disease agent 
including but not limited to bacteria, viruses, fungi, mycobacteria, protozoa, molds, multi-cellular 
parasite or yeast. For instance, it is important to monitor for the presence of bacterial or viral DNA 
that contain a mutation, which makes it resistant to treatment. By increasing the ratio of the mutant 
5 bacterial or viral allele to the normal allele, the mutant allele can be detected earlier, thereby 
allowing a new course of treatment to be prescribed. 

This is particularly important for monitoring the state of HIV infection. Most HTV strains 
initially are sensitive to protease inhibitors and other drugs. However, over time, drug-resistant 
strains develop. The methods described herein can be used for early detection of the drug-resistant 
10 strains, which will allow treatment options to be reassessed before the drug-resistant strain gains 
dominance. 
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WHAT IS CLAIMED IS: 



1 . A method for detecting a nucleic acid, wherein said method comprises: (a) isolating 
5 nucleic acid from a nucleic acid containing sample, wherein an agent that impedes cell lysis 

was added to the sample; and (b) detecting the presence or absence of the nucleic acid. 

2. The method of claim 1, wherein said sample is obtained from a source selected 
from the group consisting of bacteria, viruses, fungi, mycobacteria, protozoa, molds, yeasts, 

10 plants, humans, non-humans, multi-cellular parasite, animals, and archeabacteria. 

3 . The method of claim 2, wherein the sample is obtained from a human source. 

4. The method of claim 1, wherein the sample is obtained from a source selected from 
15 the group consisting of: a cell, fetal cell, tissue, blood, serum, plasma, saliva, urine, tear, 

vaginal secretion, umbilical cord blood, chorionic villi, amniotic fluid, embryonic tissue, 
lymph fluid, cerebrospinal fluid, mucosa secretion, peritoneal fluid, ascitic fluid, fecal 
matter, or body exudates. 

20 5 . The method of claim 4, wherein said sample is blood. 

6. The method of claim 5, wherein said sample is obtained from plasma from said 
blood. 

25 7. The method of claim 1, wherein said nucleic acid contains at least one mutation. 

8. The method of claim 7, wherein said mutation is selected from the group consisting 
of: single point mutation, multiple point mutations, an insertion, a frameshift, a truncation, 
a deletion, a duplication, and a transversion. 

30 

9. The method of claim 7, wherein said mutation is in a gene selected from the group 
consisting of: BRCA1, BRCA2, MSH6, MSH2, MLH1, RET, PTEN, ATM, H-RAS, p53, 
ELAC2, CDH1, APC, AR, PMS2, MLH3, CYP1A1, GSTP1, GSTM1, AXJN2, CYP19, 
MET, NAT1, CDKN2A, NQ01, trc8, RAD51, PMS1, TGFBR2, VHL, MC4R, POMC, 
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NROB2, UCP2, PCSK1, PPARG, ADRB2, UCP3, glurl, cart, S0RBS1, LEP, LEPR, 
SIM1, TNF, LL-6, IL-1, IL-2, IL-3, ILIA, TAP2, THPO, THRB, NBS1, RBM15, LIF, 
MPL, RUNX1, Her-2, glucocorticoid receptor, estrogen receptor, thyroid receptor, p21, 
p27, K-RAS, N-RAS, retinoblastoma protein, Wiskott-Aldrich (WAS) gene, Factor V 
5 Leiden, Factor II (prothrombin), methylene tetrahydrofolate reductase, cystic fibrosis, LDL 

receptor, HDL receptor, superoxide dismutase gene, SHOX gene, genes involved in nitric 
oxide regulation, genes involved in cell cycle regulation, tumor suppressor genes, 
oncogenes, genes associated with neurodegeneration, and genes associated with obesity 

10 10. The method of claim 1, wherein said nucleic acid is from a species different from 

the species from which the sample was taken. 

1 1. The method of claim 10, wherein said different species is from a group consisting 
of bacteria, viruses, fungi, mycobacteria, protozoa, molds, yeasts, plants, humans, non- 

i5 humans, multi-cellular parasite, animals, and archeabacteria. 

12. The method of claim 1 1, wherein said different species is a bacteria. 

13. The method of claim 1 2, wherein said bacterial species is a gram-positive or gram- 
20 negative bacteria. 

14. The method of claim 13, wherein said bacteria is selected from the group consisting 
of: Acidarainococcus, Acinetobacter, Acinetobacter lwoffi, Aeromonas, Alcaligenes, 
Bacteroides, Bordetella, Branhamella, Brucella, Calymmatobacterium, Campylobacter, 

25 Cardiobacterium, Chromobacterium, Citrobacter, Citrobacter freundii, Coliform group, 

Edwardsiella, Enterobacter, Enterobacter sakazaki, Enterobacter aerogenes, Enterobacter 
cloacae, Enterobacter agglomerans, Enterococcus, Enterococcus faecalis, Enterococcus 
faecium, Escherichia, Escherichia coli, Escherichia coli-0157, Flavobacterium, Francisella, 
Fusobacterium, Haemophilus, Hafiiia alvei, Klebsiella, Klebsiella oxytoca, Klebsiella 

30 pneumoniae, Legionella, Moraxella, Morganella, Morganella morganii, Neisseria, 

Pasturella, Plesiomonas, Proteus, Providencia, Proteus mirabilis, Pseudomonas, 
Pseudomonas aeruginosa, Salmonella, Salmonella typhimurium, Serratia, Serratia 
marcescens, Shigella, Shigella flexneri, Streptobacillus, Veillonella, Vibrio, Vibrio cholera, 
Yersinia, Yersinnia entercolitica, Xanthomanas maltophiia, Staphylococcus, 

35 Staphylococcus albus, Staphylococcus aureus, Streptococcus, Streptococcus dysgalacticae, 
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Micrococcus, Peptococcus, Peptostreptococcus, Bacillus, Bacillus cereus, Clostridium, 
Lactobacillus, Listeria, Listeria monocytogenes, Erysipelothrix, Propionibacterium, 
Eubacterium, and Corynebacterium. 

15. The method of claim 11, wherein said different species is a virus. 

16. The method of claim 1 5, wherein said virus is a DNA virus or an RNA virus. 

17. The method of claim 15, wherein said virus is selected from the group consisting 
of: retrovirus, pathogenic virus, non-pathogenic virus, drug-resistant virus, drug-sensitive 
virus, adeno-associated virus, bird flu virus, cauliflower mosaic virus, cytomegalovirus 
(CMV) 5 dengue virus, Epstein-Barr virus, feline leukemia virus, flavivirus, haemophilus 
influenza, hemorrhagic fever viruses, hepatitis virus including hepatitis A, B, C, and E, 
viruses, herpes simplex virus, human herpesvirus type A and B, human immunodeficiency 
virus (HIV), human papilloma virus, human T-cell lymphotrophic virus, HTLV Type I, 
HTLV Type II, influenza virus, Japanese encephalitis virus, moraxella catarrhalis, non- 
typeable haemophilus, reovirus, parainfluenza, parvovirus, papova virus, Respiratory 
syncytial virus, Rubella virus, rotavirus, SARS, tomato bushy stunt virus, varicella-zoster 
virus, and vaccinia virus. 

1 8. The method of claim 1 1, wherein said different species is a fungus. 

19. The method of claim 18, wherein said fungus is a drug-sensitive fungus or a drug- 
resistant fungus. 

20. The method of claim 18, wherein said fungus is selected from the group consisting 
of: Candida, Candida albicans, Candida tropicalis, Candida parapsilosis, Candida 
stellatoidea, Candida krusei, Candida parakrusei, Candida lusitanae, Candida 
pseudotropicalis, Candida guilliermondi, Candida glabrata, Aspergillus, Aspergillus 
fumigatus, Aspergillus flavus, Aspergillus niger, Aspergillus nidulans, Aspergillus terreus, 
Aspergillus sydowi, Aspergillus flavatus, Aspergillus glaucus, Cryptococcus, Histoplasma, 
Coccidioides, Paracoccidioides, Blastomyces, Basidiobolus, Conidiobolus, Rhizopus, 
Rhizomucor, Mucor, Absidia, Mortierella, Cunninghamella, Saksenaea, Pseudallescheria, 
Sporotrichosis, Fusarium, Trichophyton, Trichosporon, Microsporum, Epidermophyton, 
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Scytalidium, Malassezia, Actinomycetes, Sporothrix, Penicillium, Saccharomyces and 
Pneumocystis. 

21 . The method of claim 1, wherein isolation of nucleic acid comprises a centrifugation 
step. 

22. The method of claim 21, wherein the centrifugation step is performed with the 
centrifuge braking power set to zero. 

23 . The method of claim 21, wherein the centrifugation step is performed at a speed 
selected from the group consisting of 0-50 rpm, 50-100 rpm, 100-200 rpm, 200-300 rpm, 
300-400 rpm, 400-500 rpm, 500-600 rpm, 600-700 rpm, 700-800 rpm, 800-900 rpm, 900- 
1000 rpm, 1000-2000 rpm, 2000-3000 rpm, 3000-4000 rpm, 4000-5000 rpm, 5000-6000 
rpm, 6000-7000 rpm, 7000-8000 rpm, and greater than 8000 rpm. 

24. A composition comprising mutant DNA and normal DNA, wherein the percentage 
of free mutant DNA in the total free DNA of the composition is selected from the group 
consisting of: about 5-10% free mutant DNA, about 10-13% free mutant DNA, about 13- 
15% free mutant DNA, about 15-16% free mutant DNA, about 16-17% free mutant DNA, 
about 17-18% free mutant DNA, about 18-19% free mutant DNA, about 19-20% free 
mutant DNA, about 20-21% free mutant DNA, about 21-22% free mutant DNA, about 22- 
23% free mutant DNA, about 23-24% free mutant DNA, about 24-25% free mutant DNA, 
about 25-35% free mutant DNA, about 35-45% free mutant DNA, about 45-55% free 
mutant DNA, about 55-65% free mutant DNA, about 65-75% free mutant DNA, about 75- 
85% free mutant DNA, about 85-90% free mutant DNA, about 90-91% free mutant DNA, 
about 91-92% free mutant DNA, about 92-93% free mutant DNA, about 93-94% free 
mutant DNA, about 94-95% free mutant DNA, about 95-96% free mutant DNA, about 96- 
97% free mutant DNA, about 97-98% free mutant DNA, about 98-99% free mutant DNA, 
and about 99-99.7% free mutant DNA. 

25. A diagnostic method comprising analyzing a composition comprising mutant DNA 
and normal DNA, wherein the percentage of free mutant DNA in the total free DNA of the 
composition is selected from the group consisting of: about 5-10% free mutant DNA, about 
10-13% free mutant DNA, about 13-15% free mutant DNA, about 15-16% free mutant 
DNA, about 16-17% free mutant DNA, about 17-18% free mutant DNA, about 18-19% 
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free mutant DNA, about 19-20% free mutant DNA, about 20-21% free mutant DNA, about 
21-22% free mutant DNA, about 22-23% free mutant DNA, about 23-24% free mutant 
DNA, about 24-25% free mutant DNA, about 25-35% free mutant DNA, about 35-45% 
free mutant DNA, about 45-55% free mutant DNA, about 55-65% free mutant DNA, about 
5 65-75% free mutant DNA, about 75-85% free mutant DNA, about 85-90% free mutant 

DNA, about 90-91% free mutant DNA, about 91-92% free mutant DNA, about 92-93% 
free mutant DNA, about 93-94% free mutant DNA, about 94-95% free mutant DNA, about 
95-96% free mutant DNA, about 96-97% free mutant DNA, about 97-98% free mutant 
DNA, about 98-99% free mutant DNA, and about 99-99.7% free mutant DNA 

10 

26. The method of claim 1, wherein said method is used to detect, diagnose, or monitor 
a disease. 

27. The method of claim 26, wherein said disease is cancer. 

15 

28. The method of claim 27, wherein said cancer is selected from the group consisting 
of: carcinoma of the bladder, breast, bronchial, colon, kidney, liver, lung, esophagus, gall- 
bladder, ovary, pancreas, stomach, cervix, thyroid, prostate, and skin; small cell lung 
cancer, squamous cell carcinoma, hematopoietic tumors of lymphoid lineage, leukemia, 

20 acute lymphocytic leukemia, acute lymphoblastic leukemia, B-cell lymphoma, T-cell- 

lymphoma, Hodgkin's lymphoma, non-Hodgkin ! s lymphoma, hairy cell lymphoma, 
Burkett's lymphoma, hematopoietic tumors of myeloid lineage, acute and chronic 
myelogenous leukemias, myelodysplastic syndrome and promyelocyte leukemia, tumors of 
mesenchymal origin, fibrosarcoma and rhabdomyosarcoma, tumors of the central and 

25 peripheral nervous system, astrocytoma, neuroblastoma, glioma and schwannomas, 

melanoma, seminoma, teratocarcinoma, osteosarcoma, xenoderoma pigmentosum, 
keratoctanthoma, thyroid follicular cancer and Kaposi's sarcoma. 

29. The method of claim 1, wherein said method is used to monitor response to 
30 treatment. 

30. The method of claim 29, wherein the treatment is selected from the group 
consisting of surgery, radiation, lifestyle change, dietary protocol, supplementation and 
administration of a drug. 

35 
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3 1 . The method of claim 30, wherein the treatment is administration of a drug. 

32. The method of claim 31, wherein said drug is selected from the group consisting of: 
chemotherapeutic agents, anti-bacterial agents, anti-viral agents, anti-fungal agents, 
targeted-cancer drugs, cytoxoic agents, cytostatic agents, anti-proliferative agents, Avastin, 
altretamine, busulfan, chlorambucil, cyclophosphamide, Erbitux, Rituxan, ifosfamide, 
mechlorethamine, melphalan, thiotepa, cladribine, fluorouracil, floxuridine, gemcitabine, 
thioguanine, pentostatin, methotrexate, 6-mercaptopurine, cytarabine, carmustine, 
lomustine, streptozotocin, carboplatin, cisplatin, oxaliplatin, iproplatin, tetraplatin, 
lobaplatin, JM216, JM335, fludarabine, aminoglutethimide, flutamide, goserelin, 
leuprolide, megestrol acetate, cyproterone acetate, tamoxifen, anastrozole, bicalutamide, 
dexamethasone, diethylstilbestrol, prednisone, bleomycin, dactinomycin, daunorubicin, 
doxirubicin, idarubicin, mitoxantrone, losoxantrone, mitomycin-c, plicamycin, paclitaxel, 
docetaxel, CPT-1 1, epothilones , topotecan, irinotecan, 9-amino camptothecan, 9-nitro 
camptothecan, GS-21 1, etoposide, teniposide, vinblastine, vincristine, vinorelbine, 
procarbazine, asparaginase, pegaspargase, methoxtrexate, octreotide, estramustine, and 
hydroxyurea 

33 . The method of claim 1, wherein said nucleic acid is free nucleic acid. 
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FIG. 13 
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